Principal Site Reliability Engineer

💰 $166k-$293k
🇺🇸 United States - Remote
🔧 DevOps🟡 Principal

Job description

We’re Blue River, a team of innovators driven to create intelligent machinery that solves monumental problems for our customers. We empower our customers – farmers, construction crews, and foresters - to implement safer and more sustainable solutions, driving increased profitability with less reliance on scarce labor. We believe that focusing on the small stuff – pixel-by-pixel and task-by-task - leads to big gains.

Blue River Technology aligns with John Deere’s vision to “innovate on behalf of humanity” by quickly identifying and solving high-value, high-uncertainty challenges in AI, machine learning, computer vision, and robotics. BRT acts as a research and development flywheel, building not only new products but also new platforms that reliably create value for both Deere and its customers. From fully autonomous machines to highly precise farming equipment, BRT and Deere are partnering to create technical breakthroughs in industries like agriculture and construction.

Summary

We are looking for a Principal Site Reliability Engineer to join the CVML Platform team at Blue River Technology. You will work to create a hybrid infrastructure, integrating edge devices, on-premises, and cloud resources to a cohesive CVML & Robotics foundation. You will work on cost effectiveness, transparency, and security aspects of the platform, focusing on speed and quality of solutions and services provided. You will work with both your peers and stakeholders from other teams to achieve alignment on the platform’s vision and technologies. You must show initiative and the ability to organize your work schedule, and be comfortable with supporting the application needs of multiple teams, systems, and products.

  • Employment Type: Full-Time
  • Work Location: Remote in the United States
  • Visa sponsorship is available for this position on a case-by-case basis.

Job Responsibilities

A combination, not necessarily all-inclusive, of the following:

  • System Design: Architect and implement various cloud and on-premise applications, systems, and infrastructure.
  • Hybrid system integration: Integrate extremely diverse systems, configure stable integration, uptime, and monitoring.
  • Edge device integration: work with edge devices of various formats and integrate them with on-prem and cloud workflows, including networking, low-level OS, and electrical/control integration.
  • Low-level performance optimization: optimize the performance and throughput of the system at the filesystem, networking, and software levels.
  • High-level optimisation of cost and stability: optimize cost, operational stability, and supportability of highly diverse platforms and tech stack.
  • Product Mindset: Collaborate with cross-functional teams to design, develop, and maintain robust, scalable, and user-friendly web and mobile data-intensive applications.
  • System Integration: Build tools that enable users to easily move between different applications and platforms to utilize the strengths of each in a coherent ecosystem.
  • Collaboration: Work closely with cross-functional teams, including data scientists, analysts, software engineers, and product managers, to understand data requirements and deliver data solutions that align with business goals.
  • Documentation: Create and maintain technical documentation, including data flow diagrams, architecture designs, and standard operating procedures.
  • Technology Evaluation: Stay up-to-date with industry trends and emerging technologies related to data engineering, recommending and implementing new tools and frameworks as appropriate.

Required Experience and Skills

  • 8+ years of experience building infrastructure with K8S, AWS, and bare metal.
  • 8+ years of experience working with Python and Go (with production experience).
  • 8+ years of experience working with infra automation tools: Terraform / Terragrunt (or Pulumi / CDK).
  • 8+ experience with Linux-based systems and networks, and a deep understanding of internal components, networking, and security aspects.
  • Has a track record of building and maintaining scalable systems in production environments.
  • Experience in building CI/CD pipelines using GitHub Actions (or GitLab / Jenkins) for application release and deployment.
  • Experience in using AWS ECS, EKS, IAM, EC2, and RDS at production scale.
  • Deep understanding of Kubernetes and its internals (kubelet, CRDs, etc) and experience with building and extending clusters from scratch.
  • Strong problem-solving skills and ability to troubleshoot complex infrastructure and networking issues.
  • Excellent communication skills to collaborate effectively with technical and non-technical stakeholders.
  • Attention to detail and commitment to producing high-quality, well-documented code.

Preferred Experience and Skills

  • Experience with standard SQL, NoSQL, and MPP databases.
  • Experience with writing production Kubernetes operators.
  • Airflow, Kubeflow, or other orchestration system experience.
  • Can understand some C++ and/or Rust, or talk with people who do.
  • Prior experience in the autonomy and robotics space is a huge plus.

Only individual applicants will be considered. We do not work with unsolicited third-party agencies or proxy interview services.

At Blue River, we’re passionate about creating an inclusive workplace that promotes and values diversity.  While we have more work to do to advance diversity and inclusion, we’re investing in our programs, including recruiting, mentorship, career development, and learning & development to ensure they support our Diversity, Equity, and Inclusion goals. We support each employee in living a full life, enabling a thriving career, and accomplishing a meaningful, challenging mission while collaborating with incredible people. We are dedicated to building a diverse and inclusive workplace, so if you’re excited about this role but your experience doesn’t align completely with the job description, we encourage you to apply anyway.

We are an equal-opportunity employer and do not discriminate based on race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive other benefits and privileges of employment. Please contact us to request an accommodation.

The US annual base salary range for this position is $166,000 - $293,000, along with eligibility for Blue River’s bonus and benefit programs.

Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your location during the hiring process. During the recruitment process, we may identify an alternative role or level to which you are more suited. If your ideal role at Blue River differs from the advertised position, we will provide an updated pay range as soon as possible during the hiring process.

#LI-AN1

Share this job:
Please let Blue River Technology know you found this job on Remote First Jobs 🙏

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply