Spark 读取 Hive 数据库的代码谁能提供一份， Java 实现的

liprais

2016-03-18 09:26:16 +08:00

@anonymoustian

SparkConf conf = new SparkConf().setAppName("JavaRFormulaExample");
JavaSparkContext jsc = new JavaSparkContext(conf);
HiveContext hiveContext = new HiveContext(jsc);

anonymoustian

2016-03-18 10:57:15 +08:00

@liprais 对不起，这个我知道，但是前期的配置工作和后期的读写您能方便给个例子吗？我现在对读入 hive 里的数据输出出来一无所知，我还是有点不懂 /。

谢谢了

liprais

2016-03-18 11:02:26 +08:00

@anonymoustian
"Configuration of Hive is done by placing your hive-site.xml, core-site.xml (for security configuration), hdfs-site.xml (for HDFS configuration) file in conf/. Please note when running the query on a YARN cluster (cluster mode), the datanucleus jars under the lib directory and hive-site.xml under conf/ directory need to be available on the driver and all executors launched by the YARN cluster. The convenient way to do this is adding them through the --jars option and --file option of the spark-submit command."
把上述三个文件(hive-site.xml,core-site.xml,hdfs-site.xml)拷到 spark 的 conf 下面就行了
然后读写的时候代码如下

// sc is an existing JavaSparkContext.
HiveContext sqlContext = new org.apache.spark.sql.hive.HiveContext(sc.sc);

// Queries are expressed in HiveQL.
sqlContext.sql("select * from YOUR_HIVE_TABLE_NAME").collect();

anonymoustian

2016-03-18 12:47:30 +08:00

@liprais spark 的 conf 下面在哪里？

liprais

2016-03-18 13:07:48 +08:00

@anonymoustian
YOUR_SPARK_HOME/conf
拷到这个目录下就行了

这是一个专为移动设备优化的页面（即为了让你能够在 Google 搜索结果里秒开这个页面），如果你希望参与 V2EX 社区的讨论，你可以继续到 V2EX 上打开本讨论主题的完整版本。

https://www.v2ex.com/t/264369

V2EX 是创意工作者们的社区，是一个分享自己正在做的有趣事物、交流想法，可以遇见新朋友甚至新机会的地方。

V2EX is a community of developers, designers and creative people.