Senior Machine Learning Engineer

at Docsumo
  • Remote - Nepal

Remote

Software Development

Senior

Job description

About Docsumo:

Docsumo is your go-to Document AI solution for streamlining business operations. We turn complex documents like bank statements, policies, and financial statements into valuable, actionable data. Our cutting-edge technology helps businesses make smarter decisions faster. We are backed by marquee investors such as Sequoia, Barclays, Fifth Wall, Common Ocean, and Techstars.

At Docsumo, we’re on a mission to revolutionize how businesses handle data. We empower companies to:

Boost efficiency by 6-10 times

Make quick, accurate decisions from unstructured information

Scale operations effortlessly through innovative technology

The opportunity as Senior Machine Learning Engineer:

  • We are seeking senior professionals with over 4 years of experience in the field. This is a role for candidates with a proven track record in machine learning, deep learning, NLP, and computer vision.
  • If you have led and managed a team of ML scientists and engineers and have a strong foundation in deploying end-to-end ML solutions, this opportunity is for you.
  • This role offers the chance to rapidly advance into leadership positions, such as Lead ML Engineer, where you’ll spearhead critical projects with creative autonomy.
  • You’ll work closely with our CTO, Data Science, and Engineering teams, shaping the future of intelligent document processing for our expanding client base in the US.
  • This is a full-time role with flexible options, including hybrid work in Kathmandu or remote work from India.
  • Working hours are 10:00 am to 7:00 pm IST, with a 1-hour lunch break.
  • You will report directly to the Data Science Lead or the CTO, collaborating with a talented team to deliver innovative ML solutions that drive customer success.

Key Responsibilities

  • Collaborate with cross-functional teams of scientists and engineers to design, develop, and implement advanced machine learning systems that transform innovative ideas from conceptual stages into operational APIs.
  • Conduct cutting-edge research in machine learning, focusing on the application and fine-tuning of Large Language Models (LLMs) to develop robust, scalable solutions for intelligent document processing.
  • Lead a team of data scientists and machine learning engineers, providing mentorship and fostering a culture of collaboration and continuous learning to achieve high performance and innovative outcomes.
  • Plan, manage, and oversee the full lifecycle of ML projects, ensuring alignment with business goals and timely delivery of high-quality solutions. Develop and apply sophisticated machine learning algorithms to address complex business challenges, particularly those involving the processing and analysis of unstructured data.
  • Engage in Agile development processes including regular standups, sprint planning, and retrospectives to facilitate iterative progress and maintain high standards of output.
  • Ensure the documentation of machine learning methodologies, model development processes, and maintain rigorous testing standards to ensure reliability and efficiency of models in production.
  • Drive the integration and optimization of LLMs and other advanced models to enhance performance and operational efficiency, continuously seeking improvements.

Need to Have:

  • At least 4 years of industry experience in machine learning, deep learning, NLP, and computer vision, ideally within tech companies, product startups, or R&D environments.
  • 1-2 years of experience leading teams of 4-5+ ML scientists and engineers, demonstrating effective leadership and project management skills. Proficiency in PyTorch and TensorFlow, with experience in training deep neural networks, implementing transfer learning, and optimizing models.
  • Strong skills in classification and regression techniques, with hands-on experience in Scikit-learn, Numpy, Pandas, and Scipy. Practical experience with Transformers, such as BERT and GPT, for tasks like text classification, entity recognition, and sentiment analysis.
  • Proficiency in Python and strong understanding of Object-Oriented Programming (OOP) principles.
  • Experience with version control using Git, cloud platforms like AWS and Google Cloud, and containerization technologies such as Docker and Kubernetes.
  • Ability to work effectively in a team, demonstrating motivation, resourcefulness, and a growth mindset.
  • Bachelor’s degree in Computer Science, Statistics, Machine Learning, Physics, Mathematics, or a related field.

Nice to Have:

  • Experience with Large Language Models, including techniques like PEFT, prompt engineering, few-shot learning, and RLHF.
  • Knowledge of computer vision applications like OCR, OpenCV, CNNs, and multimodal AI.
  • Experience with advanced data visualization libraries.
  • Familiarity with Agile development practices and sprint management.
  • A track record of innovative research contributions or publications in the machine learning domain.
Share this job:
Please let Docsumo know you found this job on Remote First Jobs 🙏

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply