Previous Job
Data Engineer - High Performance Computing (HPC) and Advanced Analytical Techniques (AAT)
Ref No.: 18-11937
Location: Washington DC, District of Columbia
PROJECT DESCRIPTION: Support for Surveillance Financial Trends and Surveillance Metric Systems for HPC on Advanced Data Analytics Platform and other business initiatives requiring HPC, NLP, AI, and Machine Learning
HPC and AAT Data Engineer Responsibilities:
  • Serve as a staff specialist providing HPC configuration and systems administration support to analytical staff engaged in advanced analytical techniques.
  • Build strong relationships with business staff and advocate for the benefits of HPC, AAT, efficient processing of large text and quantitative datasets, and use of machine learning techniques and toolkits.
  • Build and develop data pipelines and ETL processes with a view towards implementing Client models at scale.
  • Conduct status meetings with Surveillance and ADAS managers and staff
  • Assist where necessary with business process redesign to take advantage of the efficiencies offered by automation
  • Assist in implementing software packages needed to meet FISMA and CMMI standards
  • Research, recommend and write processes, procedures and policies to meet FISMA and CMMI standards
  • Assist in recommending and implementing software needed to effectively leverage HPC and NLP to meet the Surveillance's business needs
  • Maintain knowledge of new or emerging advanced analytical techniques, and partner with internal business owners, technical team members, and senior management to assist and support the design, development, and implementation toolkits implementing these techniques.

HPC and AAT Data Engineer Qualifications
  • Possess at least 3 years' experience configuring and supporting advanced analytical techniques in a multi-node Linux computing environment. (Experience in multi-node cloud computing environments would meet this need.)
  • Has completed BS or MS in Computer Science, Data Science, or related discipline (preferred).
  • Possess experience utilizing Git-based code repositories.
  • Possess experience configuring machine learning toolkits including but not limited to TensorFlow, keras, and scikit-learn.
  • Possess experience configuring natural language processing (NLP) toolkits.
  • Possess at least 3 years' experience in writing shell scripts (e.g., bash) to support configurations of advanced analytical toolkits on multimode environments.
  • Possess at least 3 years' experience supporting and tuning python code in a multi-node computing environment. Experience in R or Java preferred.
  • Possess experience using and/or managing an Anaconda Python environment.
  • Possess experience accessing and submitting queries to SQL-based databases.
  • Possess experience with the Hadoop ecosystem to support high-performance computing environments (preferred).
  • Possess experience working in an Agile or Scrum-based environment (preferred).
  • Possess experience working with Project Management and SDLC artifacts
Possess strong oral and written communication skills