hero
3,039
companies
3,514
Jobs
If you are a Techstars portfolio companyclaim your profile.

Data Engineer, ML Data Platform

Twelve Labs

Twelve Labs

Software Engineering, Data Science
San Francisco, CA, USA
Posted on Tuesday, March 22, 2022
Who we are
At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media.
With a remarkable $77 million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation.
We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI.
About the role
As the Data Engineer at Twelve Labs, you will lead the development and optimization of infrastructure, pipelines, and databases crucial for processing and managing large-scale video datasets. As a key member of the team, you'll collaborate closely with ML researchers to ensure efficient access to high-quality datasets, enabling accelerated training of AI models for video understanding.

In this role, you will

  • Design the infrastructure needed for overall AI model training with video data
  • Build pipelines for collecting and processing large-scale datasets required for video understanding AI models
  • Create large-scale databases to efficiently manage video data and artifacts from preprocessing steps
  • Design and implement a version control and data cataloging system for owned data and labels
  • Work closely with the ML Research team to build pipelines for easy access and quick training on video datasets

You may be a good fit if you have

  • 3+ years of experience in software engineering (Server/Backend) or equivalent experience
  • 1+ years of experience with cloud platforms such as AWS, Azure, GCP or equivalent experience
  • 1+ years in handling large-scale data processing and building pipeline systems (e.g., Airflow)
  • Hands-on experience in building large-scale datasets for AI model training
  • Open to learning and enthusiastic about new technologies
  • Ability to work in a team and deliver results collaboratively
  • Capability to create processes in the absence of clear structures
  • Interest in finding optimal solutions to problems by understanding over/under engineering
  • Proficiency in Go language
  • Experience in processing video data
Even if there are a few checkboxes that aren’t ticked through your prior experience, we still encourage you to apply! If you are a 0-to-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at Twelve Labs.
We welcome applicants from all walks of life and are committed to equal-opportunity employment. We cherish and celebrate diversity not just because it is the right thing to do, but because it makes our company much stronger.
Benefits and Perks
🤝 An open and inclusive culture and work environment.
🧑‍💻 Work closely with a collaborative, mission-driven team on cutting-edge AI technology.
✈️ Extremely flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.
🏙 Remote-flexible, offices in San Francisco and Seoul and coworking stipend.