Skip to main content

Data Engineer with PySpark and Python

Data Engineer with PySpark and Python
SAIC
6 months 4 weeks ago

Description

SAIC is seeking experienced, results-oriented, mission-driven Data Engineers with a specialized focus to perform data model design, data formatting, and ETL development optimized for efficient storage, access, and computation in support of National Security objectives.

Responsibilities include, but are not limited to:

  • Active participant of Agile teams responsible for increasing innovation capacity while driving the velocity of development of data ingestion and data analysis.
  • Responsible for synchronized efforts with other tasks in assembling data technologies to control the flow of data from source to value, with the goal of speeding up the process of deriving value and insight.
  • The ideal candidate will have a passion for unlocking the secrets held by a dataset and solid understanding and experience with developing, automating, and enhancing all parts of the data pipeline to include ingestion, processing, storage, and exposing data for consumption.
  • The Data Engineer also implements data tests for quality and also focuses on improving inefficient tooling and adopting new transformative technologies, while maintaining operational continuity.

Qualifications

Required:

  • Active TS/SCI with Polygraph Clearance is required to be considered
  • Bachelor's Degree in Computer Science, Information Systems, Engineering (additional years of experience can be substituted for degree)
  • 10+ years of overall professional experience with Bachelors Degree, 5+ years and/or Masters Degree,
  • 3+ years of hands-on Development experience using Python to ETL data
  • 3+ years' experience using and ingesting data into SQL and NoSQL database systems
  • Experience programming in Apache Spark, PySpark, Java
  • ETL experience, to include formats such as XML, JSON and YML and normalizing data and high-volume data ingestion.
  • Familiarity with the NEXIS platform

Desired:

  • Experience with Apache NiFi Databricks
  • Experience with Databricks
  • Familiarity with building containerized services (e.g. via Docker)
  • Familiarity with data conditioning
  • Experience developing and maintaining data processing flows.
  • Experience with Amazon Web Services (AWS)
  • Experience with CI/CD pipeline
  • Experience with Agile Methodologies and Kanban Framework
  • Experience with utilizing relational databases including the use of MySQL and/ or Oracle for designing database schemas
  • Experience with Linux, REST services, and HTTP

Covid Policy: SAIC does not require COVID-19 vaccinations or boosters. Customer site vaccination requirements must be followed when work is performed at a customer site.

Overview

SAIC® is a premier Fortune 500® technology integrator driving our nation's technology transformation. Our robust portfolio of offerings across the defense, space, civilian, and intelligence markets includes secure high-end solutions in engineering, digital, artificial intelligence and mission solutions. Using our expertise and understanding of existing and emerging technologies, we integrate the best components from our own portfolio and our partner ecosystem to deliver innovative, effective and efficient solutions that are critical to achieving our customers' missions.

We are approximately 24,000 strong; driven by mission, united by purpose, and inspired by opportunities. SAIC is an Equal Opportunity Employer, fostering a culture of diversity, equity, and inclusion, which is core to our values and important to attract and retain exceptional talent. Headquartered in Reston, Virginia, SAIC has annual revenues of approximately $6.9 billion. For more information, visit . For ongoing news, please visit our .

Share: SAIC

Expertise level

Work arrangement

Similar Jobs in United States

Similar Jobs in