parquetinfo
Get information about Parquet file
Description
ParquetInfo
objects contain information about a Parquet file,
such as: file size, variable names and types, encoding, and compression schemes. To get
information about a Parquet file, create the ParquetInfo
object using the
parquetinfo
function.
Creation
Description
Input Arguments
Name of Parquet file, specified as a character vector or string scalar.
parquetinfo
works with Parquet 1.0 or Parquet 2.0 files.
Depending on the location of the file, filename
can take on one of
these forms.
Location | Form | ||||||||
---|---|---|---|---|---|---|---|---|---|
Current folder or folder on the MATLAB® path | Specify the name of the file in
Example:
| ||||||||
File in a folder | If the file is not in the current folder or in a folder on the MATLAB path, then specify the full or relative path name. Example:
Example:
| ||||||||
Internet URL | If the file is specified as an internet uniform resource locator (URL),
then Example:
| ||||||||
Remote Location | If the file is stored at a remote location, then
Based on the remote location,
For more information, see Work with Remote Data. Example:
|
Data Types: char
| string
Properties
This property is read-only.
Absolute path to Parquet file, specified as a string scalar.
Data Types: string
This property is read-only.
File size in bytes, specified as double
.
Data Types: double
This property is read-only.
Number of row groups, specified as a double
.
Data Types: double
This property is read-only.
Number of rows in each row group, specified as a double
.
Data Types: double
This property is read-only.
Variable names, specified as a string array. If the Parquet file contains
N
variables, then VariableNames
is an array of
size 1
-by-N
containing the names of the
variables.
Data Types: string
This property is read-only.
Variable data types, specified as a string array. If the Parquet file contains
N
variables, then VariableTypes
is an array of
size 1
-by-N
containing datatype names for each
variable. Each element in the array is the name of the MATLAB datatype to which the corresponding variable in the Parquet file
maps.
Data Types: string
This property is read-only.
Variable compression algorithm, specified as a string array. If the Parquet file
contains N
variables, then VariableCompression
is
an array of size 1
-by-N
containing compression
algorithm names. Each element in the array corresponds to the compression algorithm used
to compress that variable in the Parquet file. See parquetwrite
for a list of
supported compression algorithms.
Data Types: string
This property is read-only.
Variable encoding, specified as a string array. If the Parquet file contains
N
variables, then VariableEncoding
is an array
of size 1
-by-N
containing encoding scheme names.
Each element in the array corresponds to the encoding scheme used to encode that
variable in the Parquet file. See parquetwrite
for a list of
supported encodings.
Data Types: string
This property is read-only.
Parquet version, specified as either "1.0"
or
"2.0"
.
Data Types: string
Examples
Use the parquetinfo
function to create a ParquetInfo
object containing information about the file.
info = parquetinfo('outages.parquet')
info = ParquetInfo with properties: Filename: "/mathworks/devel/bat/filer/batfs2566-0/Bdoc25b.2988451/build/runnable/matlab/toolbox/matlab/demos/outages.parquet" FileSize: 44202 NumRowGroups: 1 RowGroupHeights: 1468 VariableNames: ["Region" "OutageTime" "Loss" "Customers" "RestorationTime" "Cause"] VariableTypes: ["string" "datetime" "double" "double" "datetime" "string"] VariableCompression: ["snappy" "snappy" "snappy" "snappy" "snappy" "snappy"] VariableEncoding: ["plain" "plain" "plain" "plain" "plain" "plain"] Version: "2.0"
Display the name, type, and compression scheme for the third variable in the file.
disp([info.VariableNames(3) info.VariableTypes(3) info.VariableCompression(3)])
"Loss" "double" "snappy"
Extended Capabilities
This function fully supports thread-based environments. For more information, see Run MATLAB Functions in Thread-Based Environment.
Version History
Introduced in R2019aThis function supports thread-based environments.
See Also
MATLAB Command
You clicked a link that corresponds to this MATLAB command:
Run the command by entering it in the MATLAB Command Window. Web browsers do not support MATLAB commands.
Seleccione un país/idioma
Seleccione un país/idioma para obtener contenido traducido, si está disponible, y ver eventos y ofertas de productos y servicios locales. Según su ubicación geográfica, recomendamos que seleccione: .
También puede seleccionar uno de estos países/idiomas:
Cómo obtener el mejor rendimiento
Seleccione China (en idioma chino o inglés) para obtener el mejor rendimiento. Los sitios web de otros países no están optimizados para ser accedidos desde su ubicación geográfica.
América
- América Latina (Español)
- Canada (English)
- United States (English)
Europa
- Belgium (English)
- Denmark (English)
- Deutschland (Deutsch)
- España (Español)
- Finland (English)
- France (Français)
- Ireland (English)
- Italia (Italiano)
- Luxembourg (English)
- Netherlands (English)
- Norway (English)
- Österreich (Deutsch)
- Portugal (English)
- Sweden (English)
- Switzerland
- United Kingdom (English)