Spark Basics¶
Useful Links¶
- Ampcamp big data bootcamp
- RDDs Simplified
- Elasticsearch and Apache Lucene for Apache Spark and MLlib
- Spark on AWS
- Running Apache Spark on AWS
- Running Apache Spark EMR and EC2 scripts on AWS with read write S3
- Spark on EMR - How to Submit a Spark Application with EMR Steps
- Databricks Reference Apps
- Introduction to Apache Spark with Examples and Use Cases
Spark and MongoDB¶
Spark and NLP¶
Here is a complete set of example on how to use DL4J (Deep Learning for Java) that uses UIMA on the SPARK platform
and in the following project the use of CTAKES UIMA module from within the Spark framework
Natural Language Processing with Apache Spark
GraphX¶
Apache Zeppelin¶
Connect to Zeppelin using the same SSH tunneling method to connect to other web servers on the master node. Zeppelin server is found at port 8890.