Accessing ORC data
>>> spark.sql("select score_date,count(*) from schema.table").show()
+----------+--------+
|score_date|count(1)|
+----------+--------+
+----------+--------+
So this query does not print any output, as it was accessing ORC data
Workaround
>>> spark.sql("SET spark.sql.hive.convertMetastoreOrc=false")
DataFrame[key: string, value: string]
>>> spark.sql("select score_date,count(*) from schema.table").show()
+----------+--------+
|score_date|count(1)|
+----------+--------+
|x | y|
|x | y|
You can also use Hive warehouse connector, you can find more details below
Sign up here with your email
ConversionConversion EmoticonEmoticon