Founding Data Engineer
Mattoboard
- Company Description *
Role
Data Architecture & Infrastructure: Design and build scalable data pipelines and storage solutions (ETL/ELT) to collect, clean, and normalize structured and unstructured data from web sources.
Web Scraping & Ingestion: Architect robust, fault-tolerant scraping systems using tools like Scrapy, Puppeteer, or custom crawlers; ensure compliance with robots.txt and rate-limiting best practices.
Search & Retrieval: Implement and optimize search stacks (e.g., Elasticsearch, Typesense) for full-text, faceted, and vector-based retrieval.
LLM Integration: Integrate large-language models (e.g., OpenAI GPT, Anthropic Claude) into search workflows—design prompt chains, fine-tune or embed models, and evaluate relevance metrics.
Candidate should be a software generalist with a good background in emerging AI techniques and general web based development.
We are a very small team and fully remote. We are open to both full time/part time positions, equity, etc.. Interested candidates please tell us what exceptional work you have done in any software field.