1 | initial version |
Here is one approach to transforming a dictionary presented as a string into a structured DataFrame in Scala:
import spray.json._
val jsonString = "{ \"name\":\"John\", \"age\":30, \"city\":\"New York\" }"
val jsonMap = jsonString.parseJson.convertTo[Map[String, Any]]
import org.apache.spark.rdd.RDD
import org.apache.spark.sql.SparkSession
val spark = SparkSession.builder.appName("DictionaryStringToDataFrame").getOrCreate()
val rdd = spark.sparkContext.parallelize(Seq(jsonMap))
import org.apache.spark.sql.DataFrame
val df = spark.createDataFrame(rdd)
df.printSchema()
df.show()
The output should look something like this:
root
|-- age: integer (nullable = false)
|-- city: string (nullable = true)
|-- name: string (nullable = true)
+---+--------+----+
|age| city|name|
+---+--------+----+
| 30|New York|John|
+---+--------+----+