Skip to content

Input and Output Files

parquet-java defines OutputFile and InputFile interfaces to interact with files. Originally, it only provided HadoopOutputFile and HadoopInputFile implementations that were capable of working with Hadoop and local files.

This required a Hadoop dependency to be included in the project. This is not ideal for projects that only need to work with local files, as it adds unnecessary complexity and size to the project. To address this, Parquet Java recently added LocalOutputFile and LocalInputFile implementations.

Before these classes were created, Carpet provided a local file implementation with FileSystemOutputFile and FileSystemInputFile. You can use either implementation.