Design, build, and manage complex analytics data models in Hive/Hadoop for GTM Analytics team across all customer journey from Acquisition, Engagement, and Retention. The analytics data marts will be used by data analysts in GTM Analytics and other team to do deep dive analysis, build analytics dashboard, or other data science project.
Design, build, deploy, and maintain new data models ETL pipeline with SQL query, Python, Oozie, and other script language and create/maintain workflow using Oozie.
Ensure overall data quality.
Querying and manipulating large data sets for analytical purposes using SQL-like languages (Hive is strongly preferred)
- Experience with Hadoop/big data environments to synthesize and analyze data.
- professional experience in the data warehouse space
- Good attention to detail and ability to QA multiple data sources
- Experience working on building scalable ETL pipelines, data warehousing and schema modeling
- Experience working with Oozie Workflow
- Experience with script language such as Python
- 2 - 3 years of relevant experience