Data Engineer

  • $103k-$143k
  • Remote - United States

Remote

Data

Mid-level

Job description

The CDC Foundation helps the Centers for Disease Control and Prevention (CDC) save and improve lives by unleashing the power of collaboration between CDC, philanthropies, corporations, organizations and individuals to protect the health, safety and security of America and the world. The CDC Foundation is the go-to nonprofit authorized by Congress to mobilize philanthropic partners and private-sector resources to support CDC’s critical health protection mission. Since 1995, the CDC Foundation has raised over $1.9 billion and launched more than 1,300 programs impacting a variety of health threats from chronic disease conditions including cardiovascular disease and cancer, to infectious diseases like rotavirus and HIV, to emergency responses, including COVID-19 and Ebola. The CDC Foundation managed hundreds of programs in the United States and in more than 90 countries last year. Visit www.cdcfoundation.org for more information.

Job Highlights

· Location: Remote, must be based in the United States

· Work Schedule: 8am – 5pm Eastern Standard Time.

· Salary Range: 103,500-143,500, plus benefits. Individual salary offers will be based on experience and qualifications unique to each candidate.

· Position Type: Grant funded, limited-term opportunity

· Position End Date: June 30, 2026

Overview

The Data Engineer will play a crucial role in advancing the CDC Foundation’s mission by designing, building, and maintaining data infrastructure for a public health organization. Working within Boston Public Health Commission, IT, Data modernization, the Data Engineer will deliver the architecture needed for data generation, storage, processing, and analysis. The Data Engineer will collaborate with data content experts, analysts, data scientists, data modelers, warehouse architects, IT staff and other organization staff to design and implement proposed solutions and architectures that meet the needs of the public health agency.

The Data Engineer will be hired by the CDC Foundation and assigned to the Boston Public Health Commission, IT, Data modernization. This position is eligible for a fully remote work arrangement for U.S. based candidates.

Responsibilities

· Create and manage the systems and pipelines that enable efficient and reliable flow of data, including ingestion, processing, and storage.

· Collect data from various sources, transforming and cleaning it to ensure accuracy and consistency. Load data into storage systems, data lakes or data warehouses.

· Optimize data pipelines, infrastructure, and workflows for performance and scalability.

· Design, create, test, deploy and maintain data pipelines that deliver curated, value-added data assets such as data lakes and other purpose-built data stores. Ensure data pipelines are optimized, highly reliable, and contain low technical debt.

· Monitor data pipelines and systems for performance issues, errors, and anomalies, and implement solutions to address them.

· Implement security measures to protect sensitive information.

· Collaborate with data scientists, analysts, and other partners to understand their data needs and requirements, and to ensure that the data infrastructure supports the organization’s goals and objectives.

· Collaborate with cross-functional teams to understand data requirements and design scalable solutions that meet business needs.

· Implement and maintain ETL and ELT processes to ensure the accuracy, completeness, and consistency of data.

· Design and manage data storage systems, including relational databases, NoSQL databases, and data warehouses.

· Knowledgeable about industry trends, best practices, and emerging technologies in data engineering, and incorporating the trends into the organization’s data infrastructure.

· Provide technical guidance to other staff.

· Communicate effectively with partners at all levels of the organization to gather requirements, provide updates, and present findings.

Required Qualifications

· Bachelor’s degree in Computer Science, Information Technology, Data Science, or a related field.

· Minimum of five (5) years of experience in building Data Warehouse and/or Data Lake implementations in a product-centric environment.

· Proficiency in programming languages commonly used in data engineering, such as Python, Java, Scala, and SQL Candidate should be able to implement data automations within existing frameworks as opposed to writing one off scripts.

· Experience with big data technologies and frameworks like Hadoop, Spark, Kafka, and Flink.

· Strong understanding of database systems, including relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra).

· Experience regarding engineering best practices such as source control, automated testing, continuous integration and deployment, and peer review.

· Knowledge of data warehousing concepts and tools.

· Experience with cloud computing platforms. Microsoft Azure is a plus.

· Expertise in data modeling, both in ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) processes, and data integration techniques.

· Experience in building Data Warehouse, Data Lake or other Data Platforms in Microsoft Azure platform using Azure Data Factory, SQL Server, Azure Blob Storage (Data Lake Gen2), and Powerbi Visualization tool.

· Familiarity with agile development methodologies, software design patterns, and best practices.

· Strong analytical thinking and problem-solving abilities.

· Excellent verbal and written communication skills, including the ability to convey technical concepts to non-technical partners effectively.

· Flexibility to adapt to evolving project requirements and priorities.

· Outstanding interpersonal and teamwork skills; and the ability to develop productive working relationships with colleagues and partners.

· Experience working in a virtual environment with remote partners and teams

· Experience working in Agile/Scrum environments.

· Proficiency in Microsoft Office.

Preferred Qualifications:

· Knowledge of SAS and R is desirable.

· Hands-on experience with Power BI, Tableau, or other BI tools for data visualization.

· Familiarity with Delta Lake, Medallion Architecture, or Lakehouse architecture.

· Experience with Terraform, Bicep, or ARM templates for Azure infrastructure as code.

· Microsoft certifications such as:

· Azure Data Engineer Associate (DP-203)

· Azure Fundamentals (AZ-900)

Special Notes

This role is involved in a dynamic public health program. As such, roles and responsibilities are subject to change as situations evolve. Roles and responsibilities listed above may be expanded upon or updated to match priorities and needs, once written approval is received by the CDC Foundation in order to best support the public health programming.

All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of race, color, religion, sex, national origin, age, mental or physical disabilities, veteran status, and all other characteristics protected by law.

We comply with all applicable laws including E.O. 11246 and the Vietnam Era Readjustment Assistance Act of 1974 governing employment practices and do not discriminate on the basis of any unlawful criteria in accordance with 41 C.F.R. §§ 60-300.5(a)(12) and 60-741.5(a)(7). As a federal government contractor, we take affirmative action on behalf of protected veterans.

The CDC Foundation is a smoke-free environment.

Relocation expenses are not included.

Share this job:
Please let CDC Foundation know you found this job on Remote First Jobs 🙏

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply