Senior Data Scientist

🇺🇸 United States - Remote
📊 Data🟣 Senior

Job description

The Chan Zuckerberg Initiative was founded by Priscilla Chan and Mark Zuckerberg in 2015 to help solve some of society’s toughest challenges — from eradicating disease and improving education to addressing the needs of our local communities. Our mission is to build a more inclusive, just, and healthy future for everyone.

The Team

CZI supports the science and technology that will make it possible to help scientists cure, prevent, or manage all diseases by the end of this century. While this may seem like an audacious goal, in the last 100 years, biomedical science has made tremendous strides in understanding biological systems, advancing human health, and treating disease.

Achieving our mission will only be possible if scientists are able to better understand human biology. To that end, we have identified four grand challenges that will unlock the mysteries of the cell and how cells interact within systems — paving the way for new discoveries that will change medicine in the decades that follow:

  • Building an AI-based virtual cell model to predict and understand cellular behavior
  • Developing state-of-the-art imaging systems to observe living cells in action
  • Instrumenting tissues to better understand inflammation, a key driver of many diseases
  • Engineering and harnessing the immune system for early detection, prevention, and treatment of disease

CZI’s work in science includes grantmaking programs, open-source software development, and close collaboration with the Chan Zuckerberg Biohub Network. The CZ Biohub Network includes the San Francisco, Chicago, and New York Biohubs as well as the Chan Zuckerberg Imaging Institute. CZI also collaborates with institutional partners like the Kempner Institute for the Study of Natural & Artificial Intelligence at Harvard University. Join us in accelerating science.

As a Senior Data Scientist, you’ll lead the creation of groundbreaking datasets that power our AI/ML efforts within and across our scientific grand challenges. Working at the intersection of data science, biology, and AI, your work will focus on creating large, AI-ready datasets, spanning single-cell sequencing, immune receptor profiling, and mass spectrometry peptidomics data. You will define data needs, format standards, analysis approaches and quality metrics and build pipelines to ingest, transform, and validate data products that form the foundation of our experiments.

Our Data Ecosystem:

These efforts will form a part of, and interoperate with CZI’s larger data ecosystem. We are generating unprecedented scientific datasets that drive biological innovation:

  • Billions of standardized cells of single-cell transcriptomic data, with a focus on  measuring genetic and environmental perturbations
  • 10s of thousands of donor-matched DNA & RNA samples
  • 10s PBs-scale static and dynamic imaging datasets
  • 100s TBs-scale mass spectrometry datasets
  • Diverse, large multi-modal biological datasets that enable biological bridges across measurement types and facilitate multi-modal model training to define how cells act.

When analysis of a dataset is complete, you will help publish it through public resources like CELLxGENE Discover, the CryoET Portal, and the Virtual Cell Platform, used by tens of thousands of scientists monthly to advance understanding of genetic variants, disease risk, drug toxicities, and therapeutic discovery.

Your Impact

You’ll collaborate with cross-functional teams to lead dataset definition, ingestion, transformation, and delivery for AI modeling and experimental analysis. Success means delivering high-quality, usable datasets that directly address modeling challenges and accelerate scientific progress. Join us in building the data foundation that will transform our understanding of human biology and move us along the path to curing, preventing, and managing all disease.

What You’ll Do

  • Contribute the tools required for a robust data ecosystem: build single cell data ingestion pipelines, select data formats, standards, and database schemas, and write validation tools, QC approaches, and analysis pipelines.
  • Collaborate with Platform Scientists, ML engineers, AI Researchers, and Data Engineers to iteratively evaluate, refine and grow datasets to improve our biological understanding of inflammation.
  • Work closely with Platform Scientists to identify technical variables and devise  approaches to harmonize data across generation sites to enable joint analysis.
  • Discover and define new data generation opportunities, and manage the delivery of those data products to our scientific teams.

What You’ll Bring

  • 10+ years of experience with large scale high throughput biological  data (single cell sequencing, immune receptor profiling, mass spectrometry).
  • Demonstrated ability to deliver multiple large biological data products.
  • Experience with big data: extraction, transport, loading, databases, standardization, validation, QC, and analysis.
  • Experience with processing and orchestration pipelines, such as Argo Workflows, Databricks
  • Strong fundamentals in statistical reasoning and machine learning.
  • Experience with biological data analysis and QC best practices
  • Excellent written and verbal communication skills.
  • Enthusiasm to ramp up on technologies and learn new domains.
  • Experience working in a multidisciplinary environment (scientific platforms, engineering, product, AI Research).

Compensation

The Redwood City, CA base pay range for this role is $190,000 - $261,800. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience, as evaluated throughout the interview process.

Work Mode

As we grow, we’re excited to strengthen in-person connections and cultivate a collaborative, team-oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately 3 days a week, with specific in-office days determined by the team’s manager. The exact schedule will be at the hiring manager’s discretion and communicated during the interview process.

Benefits for the Whole You

We’re thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.

  • CZI provides a generous employer match on employee 401(k) contributions to support planning for the future.
  • Annual benefit for employees that can be used most meaningfully for them and their families, such as housing, student loan repayment, childcare, commuter costs, or other life needs.
  • CZI Life of Service Gifts are awarded to employees to “live the mission” and support the causes closest to them.
  • Paid time off to volunteer at an organization of your choice.
  • Funding for select family-forming benefits.
  • Relocation support for employees who need assistance moving to the Bay Area
  • And more!

If you’re interested in a role but your previous experience doesn’t perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.

Explore our work modes, benefits, and interview process at www.chanzuckerberg.com/careers.

#LI-Hybrid

Share this job:
Please let Chan Zuckerberg Initiative know you found this job on Remote First Jobs 🙏

Similar Remote Jobs

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply