Summary
The job is for a Data Warehouse Engineer at Binance, a leading cryptocurrency exchange. The role involves building a universal data warehouse system, participating in data governance, designing a data platform integrated with data lake warehouse, and providing business insights through knowledge graph construction. The candidate should have 5+ years of experience in data lake and data warehouse design and development.
Requirements
- According to the company's data warehouse specifications and business understanding, build a universal and flexible data warehouse system
- Data model design, development, testing, deployment, online data job monitoring, and the ability to quickly solve complex problems
- Participate in Data governance, including the construction of the company’s metadata management system and data quality monitoring system
- Design and implement a data platform integrated with data lake warehouse to support real-time data processing and analysis requirements
- Build knowledge graph, and provide in-depth business insight
- Participate in technical team building and learning growth, and contribute to the team’s overall knowledge accumulation and skill improvement
- Deeply understanding of data warehouse modeling and data governance
- Solid knowledge of data warehouse development methodology, including dimensional modeling, information factory etc
- Proficient in Java / Scala / Python (at least one language) and Hive & Spark SQL programming languages
- Familiar with OLAP technology (such as: kylin, impala, presto, druid, etc.)
- Proficient in Big Data batch pipeline development
- Familiar with Big Data components including but not limited to Hadoop, Hive, Spark, Delta lake, Hudi, Presto, Hbase, Kafka, Zookeeper, Airflow, Elastic search, Redis, etc
- Have a strong team collaboration attitude and develop partnerships with other teams and businesses
- Experience in knowledge graph construction and application, and knowledge of graph databases such as Nebula, etc
Responsibilities
5+ years experiences of data lake and data warehouse design and development experience
Preferred Qualifications
- Experiences with AWS Big Data services are a plus
- Rich experience in real-time data processing, familiar with stream processing frameworks such as Apache Kafka, Apache Flink
Benefits
Not specified