fileName | String Name of the Parquet file to which the records will be written. | |
schema | String Optional. A Parquet schema against which the documents will be matched before the records are written to the file. If a schema is not provided, an IS document type must be provided using the docTypeName parameter. If any of provided documents do not conform to the given schema exactly, an exception is thrown. | |
docTypeName | String Optional. The fully qualified name of an IS document type. A document type can be provided instead of the schema parameter. The document type is converted to a Parquet schema internally and used for converting and writing the Parquet file. The tables
Table and
Mapping of Integration Server data types to Parquet Logical types list howIntegration Server data types map to Parquet schema types. Note: Either a schema or a docTypeName must be provided to validate the data before it is written to the Parquet file. | |
records | Document List Array of IData objects to be written to the Parquet file. | |
options | Document. Optional. Options such as compression methods can be passed to this service. | |
compressionCodec | String Optional. The following compression methods are supported: gzip snappy uncompressed Note: If a compressionCodec is not provided then data is not compressed. |
Integration Server Type | Parquet Basic Type |
String | STRING |
String List | Repeated STRING |
String Table | BINARY |
Document | Group |
Document List | Repeated Group |
Document Reference | Group (flatten) |
Document Reference List | Repeated Group (flatten) |
java.lang.Boolean | BOOLEAN |
java.lang.Integer | INT32 |
java.lang.Long | INT64 |
java.lang.Float | FLOAT |
java.lang.Double | DOUBLE |
java.util.Date | BINARY |
java.lang.Byte | BINARY |
java.lang.Short | BINARY |
byte[] | BINARY |
Object (unidentified) | BINARY |
Object List (unidentified) | Repeated BINARY |
Integration Server Type | Parquet Logical Type | Parquet Basic Type |
String | ENUM | Binary(ENUM) |
String | JSON | Binary(JSON) |