# BigData **Repository Path**: XuKaGit/big-data ## Basic Information - **Project Name**: BigData - **Description**: No description available - **Primary Language**: Unknown - **License**: MulanPSL-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 1 - **Forks**: 0 - **Created**: 2025-04-06 - **Last Updated**: 2025-09-05 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README ## My BigData Learning Notes 大数据的核心工作: 数据存储, 数据计算, 数据传输 ### Concepts - [数据湖,数据仓库和 Lakehouse](./Concepts/DataLakeWarehouseLakehouse.md) ### EnvBuild [EnvBuild](./EnvBuild.md) --- **大数据平台的相关环境搭建** - Linux Cloud Servers - JDK - Hadoop - Hive - Anaconda - Spark ### MySQL ### Hadoop & Hive - [HadoopIntro.md](./HadoopHive/HadoopIntro.md) --- **Hadoop理论**

- [HiveIntro.md](./HadoopHive/HiveIntro.md) --- **Hive理论**

- [HadoopPrac.md](./HadoopHive/HadoopPrac.md) --- **Hadoop实操**

- [HivePrac.md](./HadoopHive/HivePrac.md) --- **Hive实操** ### Spark - [SparkIntro.md](./Spark/SparkIntro.md) --- **Spark理论**

- **SparkPrac文件夹** --- **Spark实操** ### Flink & Kafka & Flume ### Cloud - 在 **Databricks / Azure Databricks** 上进行 **Spark与机器学习** 操作