1. Zaharia, M., et al. (2016). Apache Spark: A Unified Engine for Big Data Processing. Communications of the ACM, 59(11), 56-65.

  2. Matei, Z., et al. (2010). Spark: Cluster Computing with Working Sets. HotCloud, 10(10), 95-100.

  3. Xin, R.S., et al. (2013). Shark: SQL and Rich Analytics at Scale. SIGMOD, 12(6), 13-24.

  4. Zaharia, M., et al. (2010). Delay scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling. OSDI, 10(10), 265-278.

  5. Armbrust, M., et al. (2015). Spark SQL: Relational Data Processing in Spark. SIGMOD, 14(4), 1383-1394.

  6. Zaharia, M., et al. (2012). Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. NSDI, 12(12), 15-27.

  7. Zaharia, M., et al. (2013). Discretized Streams: Fault-Tolerant Streaming Computation at Scale. SOSP, 13(13), 423-438.

  8. Zaharia, M., et al. (2014). Spark Streaming: A Scalable Fault-Tolerant Streaming System. NSDI, 14(14), 15-28.

  9. 'Apache Spark'. (2021). In Wikipedia. Retrieved March 10, 2021, from https://en.wikipedia.org/wiki/Apache_Spark

  10. 'Spark MLlib'. (2021). In Apache Spark. Retrieved March 10, 2021, from https://spark.apache.org/mllib/

  11. 'Spark GraphX'. (2021). In Apache Spark. Retrieved March 10, 2021, from https://spark.apache.org/graphx/

  12. 'Spark Streaming'. (2021). In Apache Spark. Retrieved March 10, 2021, from https://spark.apache.org/streaming/

  13. 'Spark SQL'. (2021). In Apache Spark. Retrieved March 10, 2021, from https://spark.apache.org/sql/

  14. 'SparkR'. (2021). In Apache Spark. Retrieved March 10, 2021, from https://spark.apache.org/docs/latest/sparkr.html

  15. 'Spark on Kubernetes'. (2021). In Apache Spark. Retrieved March 10, 2021, from https://spark.apache.org/docs/latest/running-on-kubernetes.html

  16. 'Spark on YARN'. (2021). In Apache Spark. Retrieved March 10, 2021, from https://spark.apache.org/docs/latest/running-on-yarn.html

  17. 'Spark on Mesos'. (2021). In Apache Spark. Retrieved March 10, 2021, from https://spark.apache.org/docs/latest/running-on-mesos.html

  18. 'Spark on Hadoop'. (2021). In Apache Spark. Retrieved March 10, 2021, from https://spark.apache.org/docs/latest/running-on-hadoop.html

  19. 'Apache Spark vs Hadoop: What's the Difference?'. (2021). In Databricks. Retrieved March 10, 2021, from https://databricks.com/glossary/apache-spark-vs-hadoop

  20. 'Spark vs Flink: Which One Is Better for Streaming?'. (2021). In Databricks. Retrieved March 10, 2021, from https://databricks.com/glossary/spark-vs-flink

  21. 'Spark vs MapReduce: Which One Is Better for Big Data?'. (2021). In Databricks. Retrieved March 10, 2021, from https://databricks.com/glossary/spark-vs-mapreduce

  22. 'Spark vs Storm: Which One Is Better for Real-Time Processing?'. (2021). In Databricks. Retrieved March 10, 2021, from https://databricks.com/glossary/spark-vs-storm

  23. 'Spark vs Impala: Which One Is Better for SQL Querying?'. (2021). In Databricks. Retrieved March 10, 2021, from https://databricks.com/glossary/spark-vs-impala

  24. Khandelwal, K. (2020). Introduction to Spark. In Towards Data Science. Retrieved March 10, 2021, from https://towardsdatascience.com/introduction-to-spark-9b2e2a6d9c7e

  25. 'Apache Spark Tutorial'. (2021). In Tutorialspoint. Retrieved March 10, 2021, from https://www.tutorialspoint.com/apache_spark/index.htm

  26. 'Apache Spark: A Comprehensive Guide'. (2021). In DataCamp. Retrieved March 10, 2021, from https://www.datacamp.com/community/tutorials/apache-spark-tutorial-python

  27. 'Spark for Beginners: Learn Spark from Scratch'. (2021). In Analytics Vidhya. Retrieved March 10, 2021, from https://www.analyticsvidhya.com/learning-paths-data-science-business-analytics-business-intelligence-big-data/spark-for-beginners-learn-spark-from-scratch/

  28. 'Spark Documentation'. (2021). In Apache Spark. Retrieved March 10, 2021, from https://spark.apache.org/docs/latest/

  29. 'The Apache Software Foundation'. (2021). In The Apache Software Foundation. Retrieved March 10, 2021, from https://www.apache.org/

  30. Zaharia, M. (2012). An introduction to Apache Spark. In Berkeley AMPLab. Retrieved March 10, 2021, from https://amplab.cs.berkeley.edu/wp-content/uploads/2012/06/SparkTutorial.pdf

Apache Spark References: 30 Essential Resources for Big Data Processing

原文地址: https://www.cveoy.top/t/topic/oYMt 著作权归作者所有。请勿转载和采集!

免费AI点我,无需注册和登录