Data Engineer, ML Data Platform
Twelve Labs
This job is no longer accepting applications
See open jobs at Twelve Labs.See open jobs similar to "Data Engineer, ML Data Platform" Techstars.In this role, you will
- Design the infrastructure needed for overall AI model training with video data
- Build pipelines for collecting and processing large-scale datasets required for video understanding AI models
- Create large-scale databases to efficiently manage video data and artifacts from preprocessing steps
- Design and implement a version control and data cataloging system for owned data and labels
- Work closely with the ML Research team to build pipelines for easy access and quick training on video datasets
You may be a good fit if you have
- 3+ years of experience in software engineering (Server/Backend) or equivalent experience
- 1+ years of experience with cloud platforms such as AWS, Azure, GCP or equivalent experience
- 1+ years in handling large-scale data processing and building pipeline systems (e.g., Airflow)
- Hands-on experience in building large-scale datasets for AI model training
- Open to learning and enthusiastic about new technologies
- Ability to work in a team and deliver results collaboratively
- Capability to create processes in the absence of clear structures
- Interest in finding optimal solutions to problems by understanding over/under engineering
- Proficiency in Go language
- Experience in processing video data
This job is no longer accepting applications
See open jobs at Twelve Labs.See open jobs similar to "Data Engineer, ML Data Platform" Techstars.