Función Datastore

Lea recopilaciones de datos de gran tamaño

La función datastore crea un almacén de datos, que es un repositorio de recopilaciones de datos que, por su gran tamaño, no caben en la memoria. Un almacén de datos permite leer y procesar los datos almacenados en varios archivos de un disco, una ubicación remota o una base de datos como una entidad única. Si el tamaño de los datos es demasiado grande para la capacidad de la memoria, usted puede administrar la importación incremental de datos, crear un arreglo tall (alto) para trabajar con los datos o utilizar el almacén de datos como entrada para que mapreduce continúe con el procesamiento. Para obtener más información, consulte Introducción a los almacenes de datos.

Funciones

expandir todo

Crear un almacén de datos

`datastore`	Create datastore for large collections of data
`tabularTextDatastore`	Datastore for tabular text files
`spreadsheetDatastore`	Datastore for spreadsheet files
`imageDatastore`	Datastore for image data
`parquetDatastore`	Datastore for collection of Parquet files
`fileDatastore`	Datastore with custom file reader
`arrayDatastore`	Datastore for in-memory data

Leer y escribir desde un almacén de datos

`read`	Read data in datastore
`readall`	Read all data in datastore
`preview`	Preview subset of data in datastore
`hasdata`	Determine if data is available to read
`reset`	Reset datastore to initial state
`writeall`	Write datastore to files

Subdividir, hacer particiones o reorganizar el almacén de datos

`subset`	Create subset of datastore or FileSet
`isSubsettable`	Determine whether datastore is subsettable (Desde R2022b)
`shuffle`	Shuffle all data in datastore
`isShuffleable`	Determine whether datastore is shuffleable
`numpartitions`	Number of datastore partitions
`partition`	Partition a datastore
`isPartitionable`	Determine whether datastore is partitionable

Combinar o transformar almacenes de datos

Funciones

`combine`	Combine data from multiple datastores
`transform`	Transform datastore

Objetos

`CombinedDatastore`	Datastore to combine data read from multiple underlying datastores
`SequentialDatastore`	Sequentially read data from multiple underlying datastores (Desde R2022b)
`TransformedDatastore`	Datastore to transform underlying datastore

Integrar con MapReduce y arreglos altos

`KeyValueDatastore`	Datastore for key-value pair data for use with `mapreduce`
`TallDatastore`	Datastore for checkpointing `tall` arrays

Clases

expandir todo

Desarrollar un almacén de datos personalizado

`matlab.io.Datastore`	Base datastore class
`matlab.io.datastore.Partitionable`	Add parallelization support to datastore
`matlab.io.datastore.Subsettable`	Add subset and fine-grained parallelization support to datastore (Desde R2022b)
`matlab.io.datastore.HadoopLocationBased`	Add Hadoop support to datastore
`matlab.io.datastore.Shuffleable`	Add shuffling support to datastore
`matlab.io.datastore.DsFileSet`	File-set object for collection of files in datastore
`matlab.io.datastore.DsFileReader`	File-reader object for files in a datastore
`matlab.io.datastore.FileWritable`	Add file writing support to datastore
`matlab.io.datastore.FoldersPropertyProvider`	Add Folder property support to datastore
`matlab.io.datastore.FileSet`	File-set for collection of files in datastore
`matlab.io.datastore.BlockedFileSet`	Blocked file-set for collection of blocks within file

Temas

Introducción a los almacenes de datos
Un almacén de datos es un objeto para la lectura de un único archivo o una recopilación de archivos o datos.
Select Datastore for File Format or Application
Choose the right datastore based on the file format of your data or application.
Read and Analyze Large Tabular Text File
Create a datastore for a large text file containing tabular data, and then read and process the data one block at a time or one file at a time.
Read and Analyze Image Files
This example shows how to create a datastore for a collection of images, read the image files, and find the images with the maximum average hue, saturation, and brightness (HSV).
Read and Analyze MAT-File with Key-Value Data
This example shows how to create a datastore for key-value pair data in a MAT-file that is the output of mapreduce.
Read and Analyze Hadoop Sequence File
This example shows how to create a datastore for a Sequence file containing key-value data.
Trabajar con datos remotos
Trabaje con datos remotos en Amazon S3™, Azure^® Blob Storage o HDFS™.
Set Up Datastore for Processing on Different Machines or Clusters
Setup a datastore on your machine that can be loaded and processed on another machine or cluster.
Develop Custom Datastore
Create a fully customized datastore for your custom or proprietary data.
Develop Custom Datastore for DICOM Data
This example shows how to develop a custom datastore that supports writing operations.
Testing Guidelines for Custom Datastores
After implementing your custom datastore, follow this test procedure to qualify your custom datastore.

Función Datastore

Funciones

Crear un almacén de datos

Leer y escribir desde un almacén de datos

Subdividir, hacer particiones o reorganizar el almacén de datos

Combinar o transformar almacenes de datos

Funciones

Objetos

Integrar con MapReduce y arreglos altos

Clases

Desarrollar un almacén de datos personalizado

Temas

Información relacionada