50个大数据平台说明及数据源地址
-
Hadoop - Apache Hadoop is an open-source software framework used for distributed storage and processing of large datasets.
-
Spark - Apache Spark is an open-source big data processing framework that provides fast and efficient data processing.
-
Cassandra - Apache Cassandra is a distributed NoSQL database management system designed to handle large amounts of data across multiple servers.
-
MongoDB - MongoDB is a NoSQL document-oriented database that is designed to handle large amounts of data.
-
HBase - Apache HBase is a distributed, column-oriented database that is built on top of Hadoop.
-
Amazon EMR - Amazon Elastic MapReduce (EMR) is a web service that provides a managed Hadoop framework in the cloud.
-
Google BigQuery - Google BigQuery is a cloud-based data warehousing and analytics platform that enables users to analyze large datasets using SQL-like queries.
-
Microsoft Azure HDInsight - Microsoft Azure HDInsight is a cloud-based service that provides a managed Hadoop framework in the cloud.
-
Cloudera - Cloudera is a big data platform that provides a suite of tools for managing and analyzing large datasets.
-
Hortonworks - Hortonworks is a big data platform that provides a suite of tools for managing and analyzing large datasets.
-
MapR - MapR is a big data platform that provides a suite of tools for managing and analyzing large datasets.
-
IBM BigInsights - IBM BigInsights is a big data platform that provides a suite of tools for managing and analyzing large datasets.
-
Apache Storm - Apache Storm is a distributed real-time big data processing system.
-
Apache Flink - Apache Flink is a distributed stream processing framework for high-throughput, low-latency, and fault-tolerant data processing.
-
Apache Beam - Apache Beam is a unified programming model for batch and streaming data processing.
-
Apache NiFi - Apache NiFi is a data flow management system for automating the flow of data between systems.
-
Apache Kafka - Apache Kafka is a distributed streaming platform that enables users to publish and subscribe to streams of records.
-
Elasticsearch - Elasticsearch is a distributed, open-source search and analytics engine that is designed to handle large amounts of data.
-
Kibana - Kibana is an open-source data visualization and exploration platform that is designed to work with Elasticsearch.
-
Logstash - Logstash is an open-source data processing pipeline that is designed to work with Elasticsearch.
-
Splunk - Splunk is a software platform that enables users to search, analyze, and visualize large amounts of data.
-
Tableau - Tableau is a data visualization and business intelligence software that enables users to create interactive dashboards and reports.
-
QlikView - QlikView is a business intelligence software that enables users to create interactive dashboards and reports.
-
SAP HANA - SAP HANA is an in-memory database and application platform that is designed to handle large amounts of data.
-
Teradata - Teradata is a data warehousing and analytics platform that is designed to handle large amounts of data.
-
Oracle Big Data - Oracle Big Data is a suite of tools for managing and analyzing large datasets.
-
Talend - Talend is an open-source data integration platform that enables users to extract, transform, and load data from various sources.
-
Informatica - Informatica is a data integration platform that enables users to extract, transform, and load data from various sources.
-
Alteryx - Alteryx is a self-service data analytics platform that enables users to prepare, blend, and analyze data.
-
RapidMiner - RapidMiner is a data science platform that enables users to build predictive models and perform data analysis.
-
DataRobot - DataRobot is an automated machine learning platform that enables users to build predictive models without the need for coding.
-
Databricks - Databricks is a cloud-based big data processing platform that provides a unified analytics platform for data engineering, machine learning, and analytics.
-
Snowflake - Snowflake is a cloud-based data warehousing platform that provides a scalable and secure data storage and analytics solution.
-
Redshift - Amazon Redshift is a cloud-based data warehousing platform that provides a scalable and secure data storage and analytics solution.
-
Google Cloud Dataflow - Google Cloud Dataflow is a cloud-based data processing service that enables users to create data pipelines for batch and streaming data processing.
-
Apache Apex - Apache Apex is a distributed stream processing platform for high-throughput, low-latency, and fault-tolerant data processing.
-
Apache Kylin - Apache Kylin is a distributed analytical data warehouse that is designed to handle large amounts of data.
-
Apache Druid - Apache Druid is a distributed, column-oriented database that is designed for real-time analytics.
-
Presto - Presto is a distributed SQL query engine that is designed for fast and efficient data processing.
-
Apache Calcite - Apache Calcite is a framework for building SQL query engines that can be used with various data sources.
-
Apache Arrow - Apache Arrow is a cross-language development platform for in-memory data processing.
-
Apache Arrow Flight - Apache Arrow Flight is a high-performance data transport layer for distributed systems.
-
Apache Arrow Flight RPC - Apache Arrow Flight RPC is a remote procedure call (RPC) framework that is built on top of Apache Arrow Flight.
-
Apache Arrow C++ - Apache Arrow C++ is a C++ library for in-memory data processing that is built on top of Apache Arrow.
-
Apache Arrow Python - Apache Arrow Python is a Python library for in-memory data processing that is built on top of Apache Arrow.
-
Apache Arrow Java - Apache Arrow Java is a Java library for in-memory data processing that is built on top of Apache Arrow.
-
Apache Arrow Rust - Apache Arrow Rust is a Rust library for in-memory data processing that is built on top of Apache Arrow.
-
Apache Arrow Golang - Apache Arrow Golang is a Golang library for in-memory data processing that is built on top of Apache Arrow.
-
Apache Arrow JavaScript - Apache Arrow JavaScript is a JavaScript library for in-memory data processing that is built on top of Apache Arrow.
-
Apache Arrow Ruby - Apache Arrow Ruby is a Ruby library for in-memory data processing that is built on top of Apache Arrow
原文地址: https://www.cveoy.top/t/topic/fqZC 著作权归作者所有。请勿转载和采集!