Ask Your Question

Revision history [back]

click to hide/show revision 1
initial version

To display the structure of a parquet file, including the data type of each column, you can use the printSchema() method in Apache Spark. Here's an example:

val df = spark.read.parquet("path/to/parquet/file")
df.printSchema()

Alternatively, you can use the parquet-tools command-line tool to display the schema of a parquet file. Here's an example:

parquet-tools schema path/to/parquet/file

This will output the schema of the parquet file in JSON format.