View all jobs


Springfield, VA · Information Technology
HPC Data Engineer
  1. This position supports the Geospatial Services & Solutions business area to provide high-quality, cost-effective solutions to the customer. As part of the GSS Team the Data Engineer's expertise is needed to support a sophisticated enterprise environment.
  2. The Engineer is an active participant in SAFe and Scrum development teams and meetings.

  1. Identify data sources and automate collection processes
  2. Perform preprocessing of structured and unstructured data
  3. Analyze large, complex data sets to identify trends and patterns
  4. Implements processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
  5. Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues
  6. Present information using data visualization techniques
  7. Designs data integrations and data quality frameworks
  8. Designs and evaluates open source and vendor tools for data lineage
  9. Propose solutions and strategies to address Key Intelligence Questions
  10. Collaborate with engineering and product development teams

Required Skills:
  1. 3+ years of experience as a Data Scientist or Data Analysts and Bachelor's Degree in Computer Programming, Science, Engineering or a related technical discipline, or the equivalent combination of education, technical training, or work/military experience
  2. 5+ years of experience in a Linux environment
  3. Direct experience and demonstrated proficiency with multiple programming and scripting languages preferred MPI, OpenMP, Python, Julia, R, Java
  4. 4+ years of SQL experience (No-SQL experience is a plus)
  5. Experience designing, building, and maintaining data processing and management systems
  6. Experience in data mining and data transformations
  7. Experience building robust, secure data pipelines using industry standard tools and techniques (e.g. Kafka, Nifi, Spark, etc
  8. Experience using analytic tools; specifically, JupyterHub and Tableau
  9. Experience working with either a Map Reduce or an MPP system on any size/scale
  10. Experience with retrieving data from, and creating Data Lakes, Date Warehouses, and other data storage constructs
  11. Strong problem solving and troubleshooting skills
  12. Strong communication and interpersonal skills
  13. Must possess excellent time management skills and the drive to work unsupervisedExperience with deploying to on prem/data center infrastructure
    • Active TS clearance required and eligibility to obtain a CI poly

Desired Skills:
    • Prior experience working with or within the Intelligence Community or a military intelligence unit
    • Understanding of access management and security groups (i.e. IAM, S3 bucket, SSH, VPN, etc
Powered by