Principal Engineer GenAI Big Data

Job description

Role - Principal Engineer

Location - Hybrid for Islamabad

Remote for other cities

Role Summary:

Lead the design, architecture, and development of next-generation intelligent systems at the intersection of Generative AI, Big Data, and Cloud. Define technical roadmaps, build production-grade multi-agent platforms, and drive innovation across distributed systems, lakehouse architectures, and LLMOps. Operate as a hands-on technical leader and mentor without formal management responsibilities.

Key Responsibilities

  • Design and deliver production-grade RAG systems with embedding refresh strategies, vector DB synchronization, and hybrid search.
  • Architect and implement AI agent orchestration frameworks (ReAct, multi-agent coordination, persistent state, error recovery, observability).
  • Build scalable event-driven architectures with idempotency, exactly-once/at-least-once semantics, poison message handling, and backpressure management.
  • Contribute to Lakehouse data architectures (Delta Lake, Iceberg, Hudi), addressing schema evolution, compaction, and ACID transactions on object storage.
  • Develop high-performance ML/LLM code for real-time pipelines, extending frameworks when required.
  • Collaborate with data scientists and platform engineers to accelerate model experimentation, validation, and deployment.
  • Define and implement LLMOps strategies including prompt versioning, token cost tracking, evaluation, and personalization.
  • Drive architectural vision through design/code reviews, mentorship, and thought leadership.
  • Innovate in Generative AI, distributed systems, and intelligent platforms from concept through delivery.

Must-Have Skills & Tools

  • 3+ years of building and deploying ML/LLM solutions in production (RAG, LLM fine-tuning, embeddings).
  • Hands-on expertise with RAG system design: document chunking, vector DB synchronization, retrieval evaluation.
  • Deep knowledge of indexing algorithms (HNSW, IVF, LSH) and hybrid search.
  • Proven experience with agent orchestration frameworks (LangGraph, AutoGen, CrewAI, or custom).
  • Strong background in distributed systems and event-driven architectures (Kafka, Debezium, CDC, DLQs).
  • Cloud-native development expertise (AWS).
  • Strong programming skills (Python).

Nice-to-Have Skills

  • Experience with Graph ML and Graph RAG (ontologies, semantic layers, GNNs).
  • Familiarity with Big Data tools (Spark, Flink, PySpark, Glue, Druid).
  • Hands-on work with Lakehouse technologies (Delta, Iceberg, Hudi).
  • Designing evaluation frameworks for LLMs and multi-agent systems.
  • Experience handling unstructured data pipelines (PDFs, tables, images) and real-time personalization.

Soft Skills / Traits

  • Strong problem-solving in complex and ambiguous scenarios.
  • Excellent collaboration across data, AI, and engineering teams.
  • Ability to mentor peers and influence architectural decisions.
  • Clear technical communication skills for design reviews and cross-team discussions.
Share this job:
Please let Leverify know you found this job on Remote First Jobs 🙏

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply