This position supports the Geospatial Services & Solutions business area to provide high-quality, cost-effective solutions to the customer. As part of the GSS Team the Data Engineer's expertise is needed to support a sophisticated enterprise environment.
The Engineer is an active participant in SAFe and Scrum development teams and meetings.
SPECIFIC DUTIES AND RESPONSIBILITIES:
Identify data sources and automate collection processes
Perform preprocessing of structured and unstructured data
Analyze large, complex data sets to identify trends and patterns
Implements processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues
Present information using data visualization techniques
Designs data integrations and data quality frameworks
Designs and evaluates open source and vendor tools for data lineage
Propose solutions and strategies to address Key Intelligence Questions
Collaborate with engineering and product development teams
Required Skills:
3+ years of experience as a Data Scientist or Data Analysts and Bachelor's Degree in Computer Programming, Science, Engineering or a related technical discipline, or the equivalent combination of education, technical training, or work/military experience
5+ years of experience in a Linux environment
Direct experience and demonstrated proficiency with multiple programming and scripting languages preferred MPI, OpenMP, Python, Julia, R, Java
4+ years of SQL experience (No-SQL experience is a plus)
Experience designing, building, and maintaining data processing and management systems
Experience in data mining and data transformations
Experience building robust, secure data pipelines using industry standard tools and techniques (e.g. Kafka, Nifi, Spark, etc
Experience using analytic tools; specifically, JupyterHub and Tableau
Experience working with either a Map Reduce or an MPP system on any size/scale
Experience with retrieving data from, and creating Data Lakes, Date Warehouses, and other data storage constructs
Strong problem solving and troubleshooting skills
Strong communication and interpersonal skills
Must possess excellent time management skills and the drive to work unsupervisedExperience with deploying to on prem/data center infrastructure
Active TS clearance required and eligibility to obtain a CI poly
Desired Skills:
Prior experience working with or within the Intelligence Community or a military intelligence unit
Understanding of access management and security groups (i.e. IAM, S3 bucket, SSH, VPN, etc