Staff Data Engineer

at Thoughtful AI
🇺🇸 United States - Remote
📊 Data🔵 Mid-level

Job description

Join Our Mission to Revolutionize Healthcare

Thoughtful is pioneering a new approach to automation for all healthcare providers! Our AI-powered Revenue Cycle Automation platform enables the healthcare industry to automate and improve its core business operations.

We’re looking for Staff Data Engineers to help scale and strengthen our data platform.

Our data stack today consists of Aurora RDS, AWS Glue, Apache Iceberg, S3 (Parquet), Spark and Athena - supporting a range of use cases from operational reporting to downstream services. We’re looking to grow the team with engineers who can help improve performance, increase reliability, and expand the platform’s capabilities as our data volume and complexity continue to grow.

You’ll work closely with other engineers to evolve our existing pipelines, improve observability and data quality, and enable faster, more flexible access to data across the company. The platform is deployed on AWS using OpenTofu, and we’re looking for engineers who bring strong cloud infrastructure fundamentals alongside deep experience in data engineering.

Your Role:

  • Build: Develop and maintain data pipelines and transformations across the stack. Starting from ingesting transactional data into the data lakehouse to refining data up the medallion data architecture.
  • Optimize: Tune performance, storage layout, and cost-efficiency across our data storage and query engines.
  • Extend: Help design and implement new data ingestion patterns and improve platform observability and reliability.
  • Collaborate: Partner with engineering, product, and operations teams to deliver well-structured, trustworthy data for diverse use cases.
  • Contribute: Help establish and evolve best practices for our data infrastructure, from pipeline design to OpenTofu-managed resource provisioning.
  • ​​ Secure: Help design and implement a data governance strategy to secure our data lakehouse.

Your Qualifications:

  • 8-10+ years of experience building and maintaining data pipelines in production environments
  • Strong knowledge of the data lakehouse ecosystem, with an emphasis on AWS data services - particularly Glue, S3, Athena/Trino/PrestoDB, and Aurora
  • Proficiency in Python, Spark and Athena/Trino/PrestoDB for data transformation and orchestration
  • Experience managing infrastructure with OpenTofu/Terraform or other Infrastructure-as-Code tools
  • Solid understanding of data modeling, partitioning strategies, schema evolution, and performance tuning
  • Comfortable working with cloud-native data pipelines and batch processing (streaming experience is a plus but not required)

What Sets You Apart:

  • Systems thinker - you understand the tradeoffs in data architecture and design for long-term stability and clarity
  • Outcome-driven - you focus on building useful, maintainable systems that serve real business needs
  • Strong collaborator - you’re comfortable working across teams and surfacing data requirements early
  • Practical and hands-on - able to dive into logs, schemas, and IAM policies when needed
  • Thoughtful contributor - committed to improving code quality, developer experience, and documentation across the board

Why Thoughtful?

  • Competitive compensation
  • Equity participation: Employee Stock Options.
  • Health benefits: Comprehensive medical, dental, and vision insurance.
  • Time off: Generous leave policies and paid company holidays.

California Salary Range

$190,000—$250,000 USD

Share this job:
Please let Thoughtful AI know you found this job on Remote First Jobs 🙏

Similar Remote Jobs

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply