Working with a massively parallel enterprise platform, built on Free and Open Source Software products including NiFi, Hadoop and Accumulo, to enable execution of data-intensive analytics on a managed infrastructure. The position is on the Analytics Engine team. We are looking for a self-motivated individual who proactively completes tasks with a strong attention to detail. The candidate must be able to work independently and as part of a team. The candidate will be exposed to a variety of technologies depending on customer requirements. Minimally, the candidate should be skilled in concurrent programming and have experience working with large Hadoop based clusters.
Required Experience
- Shall have at least eight (8) years experience in software development/engineering, including requirements analysis, software development, installation, integration, evaluation, enhancement, maintenance, testing, and problem diagnosis/resolution.
- Shall have demonstrated experience working with OpenSource (NoSQL) products that support highly distributed, massively parallel computation needs such as Hbase, CloudBase/Acumulo, Big Table, etc.
- Shall have demonstrated work experience with the Map Reduce programming model and technologies such as Hadoop, Hive, Pig, etc.
- Shall have demonstrated work experience with the Hadoop Distributed File System (HDFS).
- Shall have demonstrated work experience with serialization such as JSON and/or BSON.
- Shall have demonstrated work experience in the requirements analysis and design of at least one Object Oriented system.
- Shall have demonstrated work experience developing solutions integrating and extending FOSS/COTS products.
- Shall have at least three (3) years experience in software integration and software testing, to include developing and implementing test plans and test scripts.
- Shall have demonstrated technical writing skills and shall have generated technical documents in support of software development project.
- Experience developing and deploying: data driven analytics; event driven analytics; sets of analytics orchestrated through rules engines.
- In addition, the candidate will have demonstrated work experience in at least four (4) of the desired characteristics.
- Hadoop/Cloud Developer Certification or comparable Cloud System/Service Certification. Six (6) months experience administering or implementing Cloud technology will be accepted as substitution for certification.
- Shall have at least six (6) years of experience developing software with high level languages such as Java, C, C++.
- Shall have demonstrated work experience developing Restful services.
- Shall have at least five (5) years experience developing software for Windows (2000, 2003, XP, VISTA) or UNIX/Linux (Redhat versions 3-5) operating systems.
Desired Experience
- Experience developing and deploying: analytics that include foreign language processing; analytic processes that incorporate/integrate multi-media technologies, including speech, text, image and video exploitation; analytics that function on massive data sets, for example more than a billion rows or larger than 10 Petabytes; analytics that employ semantic relationships (i.e., inference engines) between structured and unstructured data sets; analytics that identify latent patterns between elements of massive data sets, for example more than a billion rows or larger than 10 Petabytes; analytics that employ techniques commonly associated with Artificial Intelligence, for example genetic algorithms.
- Experience designing and developing automated analytic software, techniques, and algorithms.
- Experience with taxonomy construction for analytic disciplines, knowledge areas and skills.
- Experience developing and deploying analytics that discover and exploit social networks.
- Experience documenting ontologies, data models, schemas, formats, data element dictionaries, software application program interfaces and other technical specifications.
- Experience developing and deploying analytics within a heterogeneous schema environment.
- Experience with linguistics (grammar, morphology, concepts).
- Understanding of Big-Data Cloud Scalability (Amazon, Google, Facebook).
Note: Must have active TS/SCI with Polygraph