Data Science Software Engineer

Medable is looking for a Software Engineer with strong C++ programming skills and experience in a wide range of programming languages to assist in the development of distributed and high-performance machine learning software systems and algorithms in order to create predictive applications, derive insights from large-scale medical datasets, and to create data analytical tools for our customers and medical specialists.

WHAT YOU WILL DO:

  • Develop C++ libraries/frameworks to support data science infrastructure and applications
  • Develop distributed software systems for training large-scale machine learning models
  • Develop custom machine learning algorithms from lower-level building blocks
  • Create abstractions and easy-to-use interfaces to lower-level machine learning tools and software components
  • Selecting features, building and optimizing classifiers using machine learning techniques
  • Data mining using state-of-the-art methods such as natural language processing
  • Enhancing data collection procedures to include information that is relevant for building analytic systems
  • Processing, cleansing, and verifying the integrity of data used for analysis

EDUCATION AND EXPERIENCE:


  • B.S. or M.Sc. in computer science required
  • Solid programming and computer science skills with the ability to rapidly learn new skills and technologies
  • Prior data science and machine learning experience is desired but not strictly required
  • Experience working with medical datasets a plus

SKILLS REQUIREMENT:

  • High level of expertise in C++. The ideal candidate is someone who is passionate about writing frameworks and libraries, has a solid grasp of modern C++ (C++ 17) and enjoys writing high-performance C++ code with techniques such as template metaprogramming.
  • Moderate familiarity with other languages such as Python, Javascript, and related languages.
  • Strong proficiency with UNIX/Linux and command-line tools. 
  • Proficiency in using query languages such as SQL, or big data and NoSQL systems such as Cassandra, MongoDB, or Hadoop.
  • Excellent communication skills.

STRONGLY DESIRED SKILLS:

  • Prior proficiency in data analytical languages such as R, SAS, Matlab
  • Prior experience with parallel or distributed technologies such as MPI or Node.js
  • Concurrency programming experience with OpenMP, pthreads, or C++ threading libraries, etc
  • Prior experience with machine learning tools such as TensorFlow, Scikit-learn, numpy.
  • Familiarity with web development technologies, front-end or back-end
 
Job type: Remote work is permitted







Want to apply later?

Type your email address below to receive a reminder

Apply to Job

ErrorRequired field
ErrorRequired field
ErrorRequired field
ErrorRequired field
Error
insert_drive_file
ErrorRequired field