Systems Engineer (TS/SCI w/ FSP) - 2057682
Job Description
Job Description
Our client is seeking a Systems Engineer to support a federal agency in McLean, VA.
Job description
1. (Mandatory) Demonstrated experience utilizing big data processing tech such as Spark, Pyspark, and Python.
2. (Mandatory) Demonstrated experience data mapping, extraction, transformation and loading.
3. (Mandatory) Demonstrated experience building analytic reports in tools such as CloudWatch and Kibana.
4. (Mandatory) Demonstrated experience using AWS Step Functions to coordinate ETL pipelines
5. (Mandatory) Demonstrated experience processing and converting of OS and Data Logs into reports and metrics Dashboards.
6. (Mandatory) Demonstrated experience with Regular Expressions (RegEx).
7. (Mandatory) Demonstrated experience with SQL, MySQL, PostgresSQL.
8. (Mandatory) Demonstrated experience with Data file type processing XML, JSON.
9. (Mandatory) Demonstrated experience with IDEs and Data Modeling through Notebooks and Visual Studio.
10. (Mandatory) Demonstrated experience ETLing data from disparate structured & unstructured data formats into enriched, query-friendly structured data in indexed files.
11. (Mandatory) Demonstrated experience performing extensive data review and data quality analysis.
12. (Mandatory) Demonstrated experience developing ETL design documentation including source and target mapping and data dictionary information.
13. (Mandatory) Demonstrated experience interfacing with customers and integration partners for gaining and clarifying detailed objectives.
14. (Mandatory) Demonstrated experience supporting Agile development by contributing to tasking definition, scope and review.
15. (Desired) Demonstrated experience deploying capabilities on the Databricks unified analytics platform.
16. (Desired) Demonstrated experience in optimizing Databricks Delta Tables for query, merge and stream operations.
17. (Desired) Demonstrated experience with CI / CD using Jenkins.
18. (Desired) Demonstrated experience in joining multiple complex data sets using Spark.
19. (Desired) Demonstrated experience in tuning Spark streaming and batch jobs for cluster utilization and speed.
20. (Desired) Demonstrated experience in deploying complex, notebook-based pipelines.
21. (Desired) Demonstrated experience in Python data analysis libraries such as Pandas.
22. (Desired) Demonstrated experience utilizing Cloud services such as Lambda, SNS / SQS, or EC2.
23. (Desired) Demonstrated experience with DevOps tools to include Cloudwatch, Lambda, SQS, Dynamo and RDS.
24. (Desired) Demonstrated experience working with Elastic, ElasticSearch / Logstash / Kibana (ELK stack).
MUST be a US Citizen with a U.S. Government clearance - Intel with Polygraph
NOTE : Must have an active TS-SCI with poly. No sponsorships or upgrades are available. Submissions without this requirement will not be considered.
H1-B holders will not be considered.