Senior Data Engineer

Manifold is a full-service AI consulting company offering a complete range of AI engineering services, including machine learning, data science, data engineering, devops, cloud, and edge.


We have a proven ability to design, build, deploy, and manage complex data applications. Manifold is trusted by CTOs, CIOs, and GMs at Global 500 and high-growth companies. Our work spans industries such as consumer electronics, industrials, wireless, online commerce, digital health, and more.
 
Manifold's experienced engineers have a track record of innovation at organizations like Google, Qualcomm, MIT, and successful venture-backed startups. Our Advisory Board includes leading researchers in deep learning at Stanford and Harvard. 
 
At Manifold, you will have the opportunity to this work with different organizations, understand their data environment and solve the challenge of preparing an organizations data for analysis and machine learning.

WHAT YOU'LL DO
  • Work with our customers to understand their data sources and build distributed ETL systems across a range of platforms and technologies
  • Be an expert in the data engineering field at Manifold internally and with our customers.
  • Determine the best way to bring together disparate data and combine different data sources with both batch and real time data flows 
  • Review data and create statistical analysis to ensure the data is of high enough quality and ready for machine learning
  • Design and architect data pipeline solutions and work with other data engineers to drive building these.
  • Explore and prototype the latest data processing technologies and libraries to determine how they can be used for our customers and prospects.
  • Architect internal data pipeline and reference systems to reduce the time to deploy new solutions for our customers.
  • Work from our Oakland, CA or Boston, MA offices, with opportunities to travel to the other office and our clients. 

WHO YOU ARE
You are someone who:

  • Is detailed oriented and understands the importance of data syntax and it's semantic meaning
  • Possess excellent interpersonal and team building skills 
  • Data driven decision making is core to your lifestyle
  • Understands how to best apply technologies given the business needs of your customers
  • Enjoys presenting your findings to internal team members and your customers
  • Exhibits a positive, people-oriented, and energetic attitude 

In your past work, you probably have:

  • Designed and built production ETL, data pipeline, entity resolution systems for at least 5 years
  • Worked with data warehousing tools such as Hive, Spark, Amazon Redshift and used queuing technologies such as Kafka, Rabbit MQ, or Amazon SQS.
  • Deployed both SQL and NoSQL database technologies such as PostgresSQL, MySQL, Cassandra, CouchDB, MongoDB 
  • Experienced with parallel processing and distributed computing to handle high volumes of data very quickly
  • Deep understanding in operational aspects of a distributed compute environment
  • Have worked with Data Scientist / Analytics groups as customers of your software
  • Worked in an agile software development environment and knows how to ship iterative features to production. 

 
WHAT WE OFFER
AUTONOMY

 Doing our most creative work without the frictions of bureaucratic decision-making and inefficient meetings.
 
 GROWTH
 Growing professionally and personally by working with other exceptional people in tackling challenging problems.
 
 PURPOSE
 Making the kind of impact on other organizations and people that make every day fulfilling.
 
 CULTURE
 Optimistic, curious, humble, generous, transparent, and in control of our own destiny.
 
 TOP QUARTILE COMPENSATION
 We benchmark our compensation framework to market every year using the same data that large technology companies use.
 
Read More Here

Want to apply later?

Type your email address below to receive a reminder

Apply to Job

ErrorRequired field
ErrorRequired field
ErrorRequired field
Error
Error
insert_drive_file
insert_drive_file