Jalasoft Logo

Senior Data Engineer - AWS & RAG Pipelines

Job Description

We’re looking for a Senior Data Engineer to design and operate the cloud data infrastructure powering our AI initiatives. You’ll architect production-scale data lakes on AWS, build real-time ingestion and observability pipelines, and own the vector search and embedding layers that feed our RAG systems and autonomous agents.

Must-Have

  • Overall Experience: 7+ years in Data Engineering, Distributed Systems, or Data Architecture
  • AWS & Infrastructure: 4+ years architecting production-scale data lakes, storage tiers, and event streaming
  • AI/LLM Pipelines: 2+ years building RAG systems, managing embeddings, and orchestrating foundational models
  • Proficiency in AWS Data Lake Architecture & Storage
  • Proficiency in Real-Time Observability & Log Analytics
  • Proficiency in Elasticsearch & OpenSearch Optimization, Vectorization, Embeddings
  • Proficiency in Amazon Bedrock & Generative AI Pipelines
  • Proficiency in Software Engineering & API Ingestion
  • Production-level proficiency in one or more of: C# (.NET Core), Java, Python, or Node.js

Preferred Experience

  • AWS S3 partitioning strategies, lifecycle policies, and columnar formats (Parquet, Iceberg)

  • AWS Glue Data Catalog and Lake Formation for multi-tenant, fine-grained access control

  • Query optimization over petabyte-scale datasets using Amazon Athena and Redshift Spectrum

  • Distributed oTel collector configuration for log, trace, and metrics capture and routing into S3

  • High-volume streaming of system logs, Datadog captures, and raw server events into S3

  • Real-time CDC from PostgreSQL using Debezium or AWS DMS

  • Amazon OpenSearch clusters with simultaneous lexical and high-dimensional vector search

  • OpenSearch index lifecycle management, sharding strategies, and dynamic mappings at scale

  • Amazon Bedrock foundational model APIs (Claude, Titan) for data enrichment, classification, and semantic parsing

  • Knowledge Bases for Amazon Bedrock for automatic chunking, metadata extraction, and vector index syncs from S3

  • ETL/ELT pipelines ingesting unstructured event data from SaaS APIs (e.g., Pendo, Hotjar, Google Analytics)

  • MCP server development to expose data lake context and utilities to AI agents

  • Remote work.

  • 13 floating holiday.

  • 15 vacation days per year completed.

  • Good working environment.

Every qualified candidate who meets the requirements outlined in the job description will be considered in this hiring process without distinction.

Furthermore, Jalasoft is an equal opportunity employer. We wholeheartedly embrace our responsibility to make employment decisions without regard to race, age, marital or social status, national origin, disability, sex, gender identity or expression, or any other characteristic or group of candidates or employees unrelated to their qualifications and suitability for the position. Our management is committed to upholding this policy with respect.

Share this job:
Please let Jalasoft know you found this job on Remote First Jobs 🙏

141 similar remote jobs

Explore latest remote opportunities and join a team that values work flexibility.

Remote companies like Jalasoft

Find your next opportunity with companies that specialize in Software Development, Staff Augmentation, Software Outsourcing, and Nearshore. Explore remote-first companies like Jalasoft that prioritize flexible work and home-office freedom.

Truelogic Software Logo

Truelogic Software

Provides nearshore staff augmentation, dedicated teams, and innovation projects with LATAM tech talent for US companies.

View company profile →
QualityWorks Consulting Group, LLC Logo

QualityWorks Consulting Group, LLC

Provides software consulting, quality assurance, test automation, and agile transformation services.

View company profile →
Aequilibrium Logo

Aequilibrium

Technology consulting and digital transformation services, specializing in experience design, development, and innovation.

View company profile →
TTC Global Logo

TTC Global

Provides software assurance and testing services to corporations, government entities, and organizations worldwide.

View company profile →
Qualitest Logo

Qualitest

Delivering AI-led quality engineering solutions and managed services across global industries.

View company profile →
HatchWorks AI Logo

HatchWorks AI

AI development and data transformation services for organizations

View company profile →

Project: Career Search

Rev. 2026.6

[ Remote Jobs ]
Direct Access

We source jobs directly from 21,000+ company career pages. No intermediaries.

01

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

02

Advanced Filters

Filter by category, benefits, seniority, and more.

03

Priority Job Alerts

Get timely alerts for new job openings every day.

04

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

21,000+ SOURCES UPDATED 24/7
Apply