Principal Software Engineer Data Transformation

Job description

About Cognite

Embark on a transformative journey with Cognite, a global SaaS forerunner in leveraging AI and data to unravel complex business challenges through our cutting-edge offerings including Cognite Atlas AI, an industrial agent workbench, and the Cognite Data Fusion (CDF) platform. We were awarded the 2022 Technology Innovation Leader for Global Digital Industrial Platforms & Cognite was recognized as 2024 Microsoft Energy and Resources Partner of the Year. In the realm of industrial digital transformation, we stand at the forefront, reshaping the future of Oil & Gas, Chemicals, Pharma and other Manufacturing and Energy sectors. Join us in this venture where AI and data meet ingenuity, and together, we forge the path to a smarter, more connected industrial future.

Learn more about Cognite here

Cognite Product Tour 2024

Cognite Product Tour 2023

Data Contextualization Masterclass 2023

Our values

Impact: Cogniters strive to make an impact in all that they do. We are result-oriented, always asking ourselves.

Ownership: Cogniters embrace a culture of ownership. We go beyond our comfort zones to contribute to the greater good, fostering inclusivity and sharing responsibilities for challenges and success.

Relentless: Cogniters are relentless in their pursuit of innovation. We are determined and deliverable (never ruthless or reckless), facing challenges head-on and viewing setbacks as opportunities for growth.

About Cognite & This Role

Cognite is revolutionizing industrial data management through our flagship product, Cognite Data Fusion - a state-of-the-art SaaS platform that transforms how industrial companies leverage their data. We’re seeking an exceptional

Staff Data Platform Engineer who will drive technical strategy, architectural excellence, and cross-organizational impact.

As a Principal Software Engineer, you’ll be a technical leader who shapes platform direction, mentors engineering teams, and tackles the most complex data engineering challenges in industrial technology. This role combines deep technical expertise with strategic thinking and leadership influence.

Strategic Impact & Leadership

  • Technical Strategy & Architecture
  • Define platform architecture for next-generation industrial data processing capabilities serving 100M+ daily requests
  • Drive technology roadmap decisions that impact multiple engineering teams and product areas
  • Lead architectural reviews and establish technical standards across the data platform organization
  • Champion innovation initiatives that differentiate Cognite’s technical capabilities in the industrial data market

Platform Engineering Excellence

  • Architect foundational systems that serve as building blocks for multiple product teams and use cases
  • Design advanced abstractions and frameworks that accelerate development velocity across the organization
  • Own end-to-end platform reliability including SLAs, disaster recovery, and operational excellence
  • Drive platform observability strategy with comprehensive metrics, alerting, and distributed tracing

Advanced Technical Leadership

  • Lead Apache Spark ecosystem contributions - modify core components, contribute to open-source projects, influence roadmaps
  • Architect multi-petabyte streaming systems using Flink, Kafka, and custom processing engines for industrial IoT data
  • Design custom query engines and DSLs optimized for time-series analytics and operational data patterns
  • Pioneer advanced optimization techniques for columnar storage, query planning, and distributed execution

Organizational Impact

  • Mentor Staff and Senior engineers across multiple teams, developing technical leadership capabilities
  • Drive cross-functional initiatives involving ML platform, product engineering, and infrastructure teams
  • Establish engineering culture around technical excellence, innovation, and customer obsession
  • Represent Cognite externally through conference speaking, open- source leadership, and technical thought leadership

Exceptional Candidate Profile

  • Technical Leadership Mastery (8+ years)

  • Distributed Systems Architecture - Advanced Apache Spark expertise - contributed to Spark core, Catalyst optimizer, or execution engine with merged commits - Large-scale streaming architecture - designed and operated Flink/Kafka systems processing 1M+ events/second with complex statemanagement - Custom execution engine development - built domain-specific query engines or processing frameworks from scratch - Performance engineering excellence - consistently delivered 10x+ performance improvements through algorithmic and architectural innovations

  • Platform Engineering Leadership - Multi-tenant platform design - architected platforms serving 50+ engineering teams with sophisticated resource isolation and management - Advanced data lake architecture -designed exabyte-scale storage systems with intelligent tiering, compaction, and query optimization - Service mesh and infrastructure - implemented sophisticated microservices architectures with advanced networking, security, and observability

  • Data Engineering Innovation - OLAP system expertise - deep experience with ClickHouse, Pinot, Druid internals including cluster management and query optimization - Advanced stream processing - implemented complex event processing, windowing, and stateful computations for industrial use cases - Data modeling excellence - designed sophisticated schemas for dimensional modeling, event sourcing, and real-time analytics

Technical Leadership Excellence

  • Strategic Technical Decision Making - Technology evaluation leadership - led organization-wide technology adoption decisions with comprehensive POCs and risk analysis - Architecture governance - established and maintained architectural principles, standards, and review processes - Technical debt management - strategically balanced feature velocity, system reliability, and long-term maintainability

Open Source & Community Leadership -

  • Open-source contributions to major Apache projects in the data space (e.g. Apache Spark or Kafka) is a big plus - Significant open source contributions - maintainer or core contributor to major Apache projects (Spark, Flink, Kafka, Airflow) - Technical thought leadership - regular speaker at major conferences (Strata, Spark Summit, QCon) with recognized expertise - Community building -organized engineering communities, contributed to standards bodies, or led technical advisory roles Cross-Functional Collaboration - Product partnership excellence - translated ambiguous business requirements into robust technical architectures - Engineering leadership - mentored and developed multiple Senior and Staff engineers with measurable career progression - Executive communication - effectively communicated complex technical concepts to C-level stakeholders and board members

Startup Excellence & Scale

  • High-Impact Delivery - Rapid scaling experience - led technical initiatives during hypergrowth phases (10x team/infrastructure scaling) - Customer- facing technical leadership - directly engaged with enterprise customers on complex technical integrations - Business impact ownership - drove technical initiatives that resulted in measurable business outcomes (cost reduction, revenue enablement)
  • Innovation & Ambiguity Navigation - Greenfield platform development architected major platform components from concept to production at scale - Technical risk management - successfully navigated high-risk technical decisions with incomplete information - Competitive differentiation -developed technical capabilities that created sustainable competitive advantages

Technical Environment & Challenges

  • Advanced Technical Stack
  • Core Languages: Kotlin, Scala, Java (advanced JVM optimization)
  • Big Data Ecosystem: Apache Spark (internals), Apache Flink, Apache Kafka, Apache Airflow
  • Advanced Storage: ClickHouse, PostgreSQL, Elasticsearch, S3-compatible systems with custom optimization
  • Infrastructure: Kubernetes (operator development), Terraform,Advanced observability stack

Cutting-Edge Technologies You’ll Drive

  • Table Format Innovation: Apache Iceberg, Delta Lake internals andoptimization
  • Query Engine Development: Trino/Presto, Apache Pinot, custom engine development
  • Advanced Streaming: Complex event processing, exactly-once semantics, advanced windowing
  • ML/AI Integration: Feature stores, model serving infrastructure, MLOps platform integration

Unique Technical Challenges

  • Industrial IoT at Scale: Processing sensor data from millions of industrial assets in real-time
  • Complex Time-Series Analytics: Advanced temporal queries, anomaly detection, predictive analytics
  • Multi-Cloud Optimization: Cost and performance optimization across AWS, Azure, GCP
  • Enterprise Integration: Secure, scalable integration with legacy industrial systems

Join the global Cognite community! ๐ŸŒ

- Join an organization of 70 different nationalities ๐ŸŒ with Diversity, Equality and Inclusion (DEI) in focus ๐Ÿค

- Office location Rathi Legacy (Rohan Tech Park ) Hoodi (Bengaluru)

- A highly modern and fun working environment with sublime culture across the organization, follow us on Instagram @cognitedataย ๐Ÿ“ท to know more

- Flat structure with direct access to decision-makers, with minimal amount of bureaucracy

- Opportunity to work with and learn from some of the best people on some of the most ambitious projects found anywhere, across industries

- Join our HUB ๐Ÿ—ฃ๏ธ to be part of the conversation directly with Cogniters and our partners.

- Hybrid work environment globally

Why choose Cognite? ๐Ÿ† ๐Ÿš€

Join us in making a real and lasting impact in one of the most exciting and fastest-growing new software companies in the world. We have repeatedly demonstrated that digital transformation, when anchored on strong DataOps, drives business value and sustainabilityfor clients and allows front-line workers, as well as domain experts, to make better decisions every single day. We were recognized as one of CNBC’s top global enterprise technology startups powering digital transformation! And just recently, Frost & Sullivan named Cognite a Technology Innovation Leader! ๐Ÿฅ‡ Most recently Cognite Data Fusionยฎ Achieved Industry First DNV Compliance for Digital Twins ๐Ÿฅ‡

Apply today!

If you’re excited about the opportunity to work at Cognite and make a difference in the tech industry, we encourage you to apply today! We welcome candidates of all backgrounds and identities to join our team.

We encourage you to follow us on Cognite LinkedIn; we post all our openings there.

Share this job:
Please let Cognite know you found this job on Remote First Jobs ๐Ÿ™

Similar Remote Jobs

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service ๐Ÿ™

Apply