hero
3,386
companies
3,554
Jobs
If you are a Techstars portfolio companyclaim your profile.

Machine Learning Engineer (2-4 years)

Docsumo

Docsumo

Software Engineering
Remote
Posted on Jul 27, 2024
About Us:
Docsumo is a Document AI software that helps enterprises capture data and analyze customer documents. We convert documents such as invoices, ID cards, and bank statements into actionable data. We are work with clients such asPayU, Arbor and Hitachiand backed bySequoia, Barclays, Techstars, and Better Capital.
We are looking for a Machine Learning Engineer to help us create & deploy machine learning solutions. As anMLEngineer you will be part of a team deploying state-of-the-art machine learning models built by our Data Scientists. You will be working directly with the CTO to develop end to end API products for the US market in the information extraction domain.

Responsibilities

  • Apply Data Science concepts to solve routine problems of target users.
  • You will be designing and building systems that help Docsumo process visual data i.e. as PDF & images of documents.
  • Experience with training and fine-tuning machine learning models on large text datasets.
  • You'll work in our Machine Intelligence team, a close-knit group of scientists and engineers who incubate new capabilities from whiteboard sketches all the way to finished apps.
  • You will get to learn the ins and outs of building core capabilities & API products that can scale globally.
  • Should have hands-on experience applying advanced statistical learning techniques to different types of data.
  • Should be able to design, build and work with RESTful Web Services in JSON and XML formats. (Flask preferred)
  • Produce clean, efficient code based on specifications
  • Should follow Agile principles and processes including (but not limited to) standup meetings, sprints and retrospectives.

Requirements

  • Minimum 2+ years experience working in machine learning, text processing, data science, information retrieval, deep learning, natural language processing, text mining, regression, classification, etc.
  • Must have a full-time degree in Computer Science or similar (Statistics/Mathematics)
  • Working with OpenCV, TensorFlow and Keras
  • Working with Python: Numpy, Scikit-learn, Matplotlib, Panda
  • Familiarity with Version Control tools such as Git
  • Familiarity with ML frameworks and libraries
  • Theoretical and practical knowledge of SQL / NoSQL databases with hands-on experience in at least one database system.
  • Team player, an ability to work cooperatively with the other engineers
  • Ability to make quick decisions in high-pressure environments with limited information.
  • Outstanding analytical and problem-solving skills
  • Must be self-motivated, flexible, collaborative, with an eagerness to learn