Script to access Amazon S3 files

29 visualizaciones (últimos 30 días)
André Bibaud
André Bibaud el 25 de Sept. de 2017
Comentada: Stephane el 23 de Dic. de 2022
I setup a cluster on AWS EC2 to run my scripts faster. Right now my script load input variables locally (on my C: drive) and write simulation results also locally. I setup an S3 bucket with a sub-directory for my input variables and a sub-directory for the results. How do I change the load and write function to point to the S3 bucket instead of my c: drive?
Thanks

Respuestas (3)

Jhon Wine
Jhon Wine el 24 de En. de 2018
I found datastore to be very easy to use. Run the code below in each worker.
%Set S3 Credentials
setenv('AWS_ACCESS_KEY_ID', 'ID');
setenv('AWS_SECRET_ACCESS_KEY','Key');
setenv('AWS_REGION', 'us-west-2');
%Load Data
fp= 's3://mybucket/data/file.data';
ds=fileDatastore(spectralFilePath,'ReadFcn',@AWSRead);
data=ds.read;
Also define this function
function data= AWSRead(fileName)
fid = fopen(fileName);
data= fread(fid,inf,'short');
fclose(fid);
end
Make sure the EC2 cloud is on the same site as S3 to avoid data transfer bottlenecks

David Fink
David Fink el 28 de Sept. de 2017
Since S3 uses a RESTful architecture, use 'webread' and 'webwrite' to access the S3 bucket.
  1 comentario
Jhon Wine
Jhon Wine el 17 de Nov. de 2018
Is there an example, how to use webread and webwrite with S3? There is no good tutorial..

Iniciar sesión para comentar.


Harish Sundaresh
Harish Sundaresh el 26 de Jun. de 2021
can MATLAB read .json files from aws s3? I get these errors
%% webread
webread(fname)
Error using webread (line 119)
The 's3' protocol specified in URL, 'url', is not supported. Specify the URL with the protocol 'http://' or 'https://'.
%% jsondecode
Error using jsondecode
JSON syntax error: expected value but found end of text.
  2 comentarios
Dmitry Kaplan
Dmitry Kaplan el 8 de Feb. de 2022
I could not get webread to work. I am sure there is a way, but I could not figure it out.
Instead, I use something like this
fds = fileDatastore('s3://blah/blah/filename.json','FileExtensions',{'.json'},'readfcn',@fileread);
file_data = read(fds);
This defines the read function for the datastore to be fileread(). Invoking the read() actually runs fileread() and passes to it the filename -- because the fds.ReadMode is 'file'.
Stephane
Stephane el 23 de Dic. de 2022
Thanks for sharing this Dmitry

Iniciar sesión para comentar.

Etiquetas

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by