Having spearheaded best practices throughout the evolution of data from structured data warehouse methods to big data analytics, Caserta provides enterprise-level innovative solutions for our clients to keep ahead of the technology curve and leverage their data to the fullest extent.
As a Data Engineer at Caserta, you’ll work in small teams to deliver innovative solutions on Amazon Web Services, Azure, and Google Cloud using core cloud data warehouse tools, Spark, Event Stream platforms, and other Big Data related technologies. In addition to building the next generation of data platforms, you’ll be working with some of the most forward-thinking organizations in data and analytics.
Responsibilities:
• Work as part of a team to develop Cloud Data and Analytics solutions
• Participate in development of cloud data warehouses and business intelligence solutions
• Data wrangling of heterogeneous data and explore and discover new insights
• Gain hands-on experience with new data platforms and programming languages (e.g. Python, Hive, Spark)
Qualifications:
• Related work experience in Data Engineering or Data Warehousing
• Hands-on experience with leading commercial Cloud platforms, including AWS, Azure, and Google
• Proven experience with data warehousing, data ingestion, and data profiling
• Proficient in SQL
• Strong aptitude for learning new technologies and analytics techniques
• Highly self-motivated and able to work independently as well as in a team environment
• Understanding of agile project approaches and methodologies
• Proficient in a source code control system, such as Git
• Proficient in the Linux shell, including utilities such as SSH
- Optimizing the performance of business-critical queries and dealing with ETL job related issues
- Building and migrating the complex ETL pipelines from system to Redshift
- Extracting and combining data from various heterogeneous data sources
- Experience using Apache Spark
- AWS experience
- Strong communication – ability to explain complex technical issues in non-technical terms
- Knowledge of database structures, theories, principles, and practices
- Experience with S3 datalakes ideal
- Familiarity with implementing analytics solutions with one or more Hadoop distributions (Cloudera, Hortonworks, MapR, HDInsight, EMR)
- Familiarity with streaming data ingestion
- Proficient in Python
- Consulting experience
- Familiarity or strong desire to learn quantitative analysis techniques (e.g., predictive modeling, machine learning, segmentation, optimization, clustering, regression)
- Bachelor’s degree in Business Analytics, Computer Science or a closely related field required
Caserta is an equal opportunity employer. Applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Caserta fosters a collaborative environment for true technologists with a passion for creating innovative data solutions to solve the most complex problems businesses face today.