Data Scientist - Gen AI / QA

at Zyte
๐Ÿ‡ฆ๐Ÿ‡ท Argentina - Remote
๐Ÿ“Š Data๐Ÿ”ต Mid-level

Job description

About Us

At Zyte, we eat data for breakfast and you can eat your breakfast anywhere and work for Zyte. Founded in 2010, we are a globally distributed team of over 250 Zytans working from over 28 countries who are on a mission to enable our customers to extract the data they need to continue to innovate and grow their businesses. We believe that all businesses deserve a smooth pathway to data

For more than a decade, Zyte has led the way in building powerful, easy-to-use tools to collect, format, and deliver web data, quickly, dependably, and at scale. And today, the data we extract helps thousands of organizations make smarter business decisions, secure competitive advantage, and drive sustainable growth. Today, over 3,000 companies and 1 million developers rely on our tools and services to get the data they need from the web.

Data QA is an important function within Zyte. The Data QA team works to ensure that the quality and usability of the data scraped by our web scrapers meets and exceeds the expectations of our enterprise clients.

Are you passionate about data and data quality and integrity?

Do you enjoy using Python and AI to analyze and manipulate data, detect data quality issues, and visualize your findings?

Are you highly customer-focused with excellent attention to detail?

Owing to growing business and the need for ever more sophisticated Data QA, we are looking for a talented Data Scientist to join our team. As a Zyte Engineer, you work on AI-based data wrangling, data manipulation, and data visualisation techniques and apply them in the verification and validation of data quality as it pertains to data extracted from the web.

Roles & Responsibilities:

  • Understand customer web scraping and data requirements; map these requirements to custom AI-based data quality validation techniques, with a focus on achieving pre-established degrees of data quality and uncovering data quality issues.
  • Draw conclusions about data quality by producing descriptive and evidence-based statistics, summaries, and visualisations.
  • Supplement existing manual QA and schema validation techniques with AI-based data quality verification.
  • Collaborate with developers to further troubleshoot and pinpoint solutions.
  • Present findings and conclusions to stakeholders at various levels (other members of the QA department, developers, project managers, account managers, customers).
  • Write high-quality, well-structured code that is maintainable and extensible.
  • Manage code using GitHub, BitBucket and other version control approaches as applicable.

Requirements:

  • Highly proficient in Python and the PyData stack. Minimum of 3 years (please provide code samples in your application - ideally pertaining to data analysis or Generative AI - via a link to GitHub or other publicly-accessible service).
  • BS degree in Computer Science, Engineering, Mathematics, Statistics or equivalent.
  • Up to speed on the latest advances in Generativeย  AI particularly as they pertain to process automation, web scraping/parsing, and data quality verification.
  • Comfortable with Prompt Engineering and token/cost optimization.
  • Familiar with abstraction layers (MCP, Marvin, Langchain etc).
  • Experience coding against the APIs of at least one of the Google, OpenAI, or Anthropic models.
  • Experience in data quality visualization and the visualisation of data quality issues.
  • Ability to work with very large datasets (into the millions of records).
  • Strong knowledge of software QA methodologies, tools, and processes.
  • Excellent level of written and spoken English; confident communicator; able to communicate on both technical and non-technical levels with various stakeholders on all matters of QA.
  • Outstanding attention to detail.

Desired Skills:

  • Prior experience in a Data QA role (where the focus was on verifying data quality, rather than testing application functionality).
  • Familiarity with Jupyter and JupyterLab.
  • Experience building your own dashboards.
  • Experience with Spark, BigQuery, and other big data technologies.
  • Previous remote working experience.

As a new Zytan, you will:

Become part of a self-motivated, progressive, multi-cultural team.

Have the freedom and flexibility to work from where you do your best work.

Attend conferences and meet with team members from across the globe.

Work with cutting-edge open source technologies and tools.

Share this job:
Please let Zyte know you found this job on Remote First Jobs ๐Ÿ™

Similar Remote Jobs

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service ๐Ÿ™

Apply