Anel Nurkayeva

Anel Nurkayeva

About Me

Data Governance, ML/AI
image

I am Data Manager

Accomplished Data Curator with a strong track record in managing and optimizing large-scale datasets to advance autonomous driving technologies. With expertise in designing robust data collection strategies, implementing rigorous quality controls, and leveraging Python, Pandas, NumPy, SQL, Git, Anel has consistently driven impactful insights and operational efficiencies. Her career spans pivotal roles at leading tech firms, where she has excelled in fostering collaborative environments and spearheading transformative projects that align with strategic business objectives. Anel’s commitment to excellence and her proficiency in machine learning underscore her ability to drive innovation and operational excellence in dynamic, data-intensive projects from labeling to deployment.

Worked with

https://anel.codes/wp-content/uploads/2024/06/PG-logo-1536x1025-1-40x40.png
https://anel.codes/wp-content/uploads/2024/06/Google-logo-1536x864-1-40x40.png
https://anel.codes/wp-content/uploads/2024/06/Meta-Logo-1536x864-1-40x40.png

Resume

6 Years of Experience

Volunteering

2020

Lead Data Science Manager

Omdena

  • Collected data and Curated datasets for quarterly challenges
  • Lead Data Labeling Efforts, Lead Modeling efforts, created Deep Learning algorithm
  • Mentored geographically distributed team of jr ml engineers through data preparation, modeling, labeling and reporting
  • Created ML models of outstanding accuracy

Experience

July 2022 - March 2024

Data Asset Manager

Proctor and Gamble

  • Led the development of large-scale datasets critical for enhancing, ensuring alignment with neural network requirements, unlocking $200M in energy savings 
  • Identified improvement opportunities, integrate stakeholder feedback, and collaborate on process enhancements for data publication creating new metadata load process slashing process from 1 month to 2 days
  • Designed and implemented data collection strategies leveraging engineering resources and customer data to gather thousands of examples.
  • Streamlined  complex datasets in pipelines and dashboards , influencing leaders to make data driven decisions
  • Curated data catalog- articles, dataset metadata, terms and conditions  
  • Lead certification trainings - organizing training sessions (Data Integrity and Operability), cultivating an inclusive work environment that fosters authenticity and encourages all voices to be heard



Sep 2021 - June 2022

Data Manager

QAnalyst@Facebook VR

  • Managed conduction requirement collection, data modeling, and developed data pipelines and dashboards for inventory, stock, and product feedback
  • Coordinated with AI Engineers and Test teams to develop and validate evaluations, facilitating continuous improvement
  • Implemented rigorous auditing and filtering protocols to maintain dataset integrity and enhance AI training efficacy.
  • Drove cohesive vision alignment across teams by providing customer-centric input on priorities and communicating status updates to ensure clarity on objectives and delivery.
  • Troubleshooted issues and bugs, identifying upstream issues preventing data breakages 
  • Leveraged data expertise to drive data analysis projects.
  • Became an expert  and go to person in data extraction, transformation, and visualization, enabling our teams to derive meaningful insights that drive business success.
  • Produced detailed reports highlighting key issues, release blockers, project progress, and actionable next steps for various projects.



Jan 2020 - June 2022

Ops Analytics Manager

SMX@Google Shopping

  • Created SOP for labeling and flagging transactions decreasing fraud by 0.04 % (12M a week)
  • Designed and maintained risk detection and monitoring dashboards and alerts 
  • Became an expert in core risk infrastructure, tools, systems, and data
  • Collaborated with a diverse array of stakeholders, from both business and technical sides becoming data expert of the team
  • Created image classifier to discriminate between industrial and residential addresses



Apr 2018 - June 2020

Lead content labeling analyst - Data Integrity Team

ProUnlimited@Facebook

  • Elevated quality metrics from 50% to 90% by implementing data-driven training for QA teams 
  • Created and managed daily dashboards tracking volume and labeling quality using Tableau, SQL, and Python 
  • Trained new labelers and offshore teams
  • Written Labeling Guidelines further improving quality and the process

• Supported establishment of data quality rules with the business. Monitor data quality and coordinate the remediation of data quality issues and data exceptions. Monitor and report related metrics 

Apr 2017 - Apr 2018

Labeler / Annotator

YouTube

  • Wrote a script to speed up the process of submitting reviewed data increasing speed of submission X5 times
  • Supported urgent requests and projects while collaborating with cross-functional teams
  • Made suggestions to the policy change



Coding

Python

Pandas

Git

SQL

NumPy

Data Management

Data Labeling

Guidelines, SOP, Labeling Rules

Data Visualization

ML/AI data prep

Certifications

2024

Certified Data Asset Manager

CDMP

2023

AI Solution Architect

Microsoft

2024

PSPO I

Agile/Scrum

2022

AI Product Management

Udacity

Education

2022

Master's Applied Data Science

University of Michigan

2010

MBA

SF Bay University

2017

Bachelor's of Computer Science

SF Bay University

Awards

2004

Kazakhstan Governmental Educational Grant

Kazakhstan Government

2020

Bertelsman Scholarship

Udacity

2020

Kahuna award floor outstanding leadership (Udacity)

Udacity

Assigned Celebration channel Leader

Creator of AI challenges for students  (data curation and organization)

2022

AI Product Management

Udacity

Contact

Get in Touch

Get in Touch

San Jose, CA, USA
anel@umich.edu
+14089159868