Job description
Join phData, a dynamic and innovative leader in the modern data stack. We partner with major cloud data platforms like Snowflake, AWS, Azure, GCP, Fivetran, Pinecone, Glean, and dbt to deliver cutting-edge services and solutions.We’re committed to helping global enterprises overcome their toughest data challenges.
phData is a remote-first global company with employees based in the United States, Latin America, and India. We celebrate the culture of each of our team members and foster a community of technological curiosity, ownership, and trust. Even though we’re growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results.
- 6x Snowflake Partner of the Year (2020, 2021, 2022, 2023, 2024, 2025)
- Fivetran, dbt, Atlation, and AWS Partner of the Year
- #1 Partner in Snowflake Advanced Certifications
- 600+ Expert Cloud Certifications (Sigma, AWS, Azure, Dataiku, etc)
Recognized as an award-winning workplace in the US, India, and LATAM
Senior Data Engineer - Internal Platform
At phData, the Platform team builds and operates our internal Intelligence Platform, powering our operations, sales, and delivery teams with data, analytics, and AI‑driven insights. We provide the core data and technology foundation that helps phData run efficiently and make better decisions every day.
We’re looking for a Senior Data Engineer to lead the design, implementation, and operations of our internal data platform, enabling faster, smarter decision‑making across the company. This is a unique opportunity to drive both technology and business outcomes on the cutting edge of AI, working on systems that directly impact how phData runs and grows.
What You’ll Do:
- Lead the internal data platform on Snowflake and AWS, including data modeling, orchestration, and governance for key internal use cases (operations, sales, delivery, finance, and AI/ML).
- Design, build, and maintain robust data pipelines using SQL, dbt, and modern ETL/ELT tooling to serve analytics, operational reporting, and AI workloads.
- Productionize and operate AI‑powered features and workflows, leveraging AI‑assisted development practices and tools such as GitHub Copilot.
- Partner with stakeholders across Operations, Sales, Delivery, and Finance to understand data needs and translate them into reliable, well‑documented, and maintainable technical solutions.
- Ensure reliability, performance, and security of the platform, including monitoring, observability, and adherence to best practices for cloud data platforms.
- Leverage infrastructure‑as‑code to provision and manage data platform resources in a repeatable and auditable way.
- Contribute to documentation and standards, including data models, roadmaps, runbooks, diagrams, and data cataloging to drive consistency and reusability across the platform.
Required Experience:
- 5+ years as a hands-on Data Engineer and/or Software Engineer, designing and implementing data solutions
- Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration.
- Ability to multitask, prioritize, and work across multiple projects at once.
- Deep experience in cloud data platforms, including Snowflake and AWS
- Strong SQL and dbt skills for modeling, debugging, and optimizing transformations.
- Programming expertise in Python and/or Java, includingexperience with the software development life cycle, including unit and integration testing.
- Data ingestion experience, including custom API integrations utilizing Fivetran and reverse ETL with tools like Census.
- Basic working knowledge of BI tools like Sigma Computing.
- Experience with AI-assisted development workflows and best practices, including GitHub Copilot.
- Knowledge of Infrastructure as Code tools such as Terraform, CloudFormation, or phData’s provision tool.
- Excellent written and verbal communication skills and experience, including creating and delivering detailed presentations.
- Detailed solution documentation, including roadmaps, diagrams, and data cataloging.
- 4-year Bachelor’s degree in Computer Science or a related field
Prefer any of the following:
- Production experience in other data platforms: Azure, GCP, Hadoop, Databricks
- Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra, or other NoSQL storage systems
- Data integration technologies: Spark, Kafka, event/streaming, Fivetran, AWS Data Migration Services, Azure DataFactory, Google DataProc, or other data integration technologies
- Multiple data sources, including queues, relational databases, files, search indexes, and APIs.
- Complete software development lifecycle experience, including design, documentation, implementation, testing, and deployment
- Automated data transformation and data curation: Spark, Spark streaming, automated pipelines
Why phData
- Work on a high‑impact internal platform that directly shapes how phData operates, sells, and delivers for its customers.
- Build on a modern cloud data stack (Snowflake, AWS, Fivetran, dbt, Sigma Computing, etc.) with strong support for experimentation and improvement.
- Collaborate with experienced data, analytics, and AI practitioners, and help define how we run phData on data and AI.
- Enjoy a remote‑friendly culture with a distributed team across the globe.
If you’re excited about building and operating data platforms that power real decisions—and you’re interested in working at the intersection of data, cloud, and AI—we encourage you to apply, even if you don’t meet every qualification listed above.
phData celebrates diversity and is committed to creating an inclusive environment for all employees. Our approach helps us to build a winning team that represents a variety of backgrounds, perspectives, and abilities. So, regardless of how your diversity expresses itself, you can find a home here at phData. We are proud to be an equal opportunity employer. We prohibit discrimination and harassment of any kind based on race, color, religion, national origin, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, veteran status, genetic information, disability, or other applicable legally protected characteristics. If you would like to request an accommodation due to a disability, please contact us at People Operations.








