Backend/Data Engineer
PrettyData
*Please do not cold email us, that will disqualify you; just submit an application here. No agencies either. *
Role Overview
Want to get in on the ground floor of a startup? This is the opportunity for you!
You'll own our data pipeline end-to-end: web scrapers collecting industry data, ETL pipelines structuring it, and backend APIs making it accessible. This is the foundation enabling everything from pricing optimization to AI-powered business insights.
You'll work with our CTO advisor on the scraping system and our backend engineer on integration. Expect to ship code daily across scraping, ETL, infrastructure, and backend development.
What You'll Do
- Own and expand our web scraping system (Yelp, Google, booking platforms)
- Build data quality pipelines—clean, validate, transform scraped data
- Develop backend connectors/APIs for booking systems (Mindbody, Boulevard, Meevo)
- Deploy and manage services (Docker, VMs, monitoring) with Postgres, Celery, Redis
- Bonus: Contributing to AI/ML features
Who You Are
You're a builder who ships without needing detailed specs or senior oversight. You figure out what needs doing, make pragmatic calls, and iterate fast. You thrive in unstructured startup environments and want to work across the full data stack—not specialize in one thing.
We value pragmatic problem-solving, shipping and iterating, direct communication, and learning through challenges. We're committed to building an inclusive team where people of all backgrounds thrive.
Requirements
- 3+ years Python backend development (we use FastAPI, but similar frameworks transfer)
- Web scraping experience with Celery or async task queues (BeautifulSoup, Scrapy)—you've handled dynamic content, rate limiting, and data extraction challenges
- ETL pipeline design using task orchestration (Celery, async workflows)—building pipelines for data transformation and quality validation
- Backend API development and third-party integrations (REST APIs, authentication, webhooks)
- SQL and relational databases (PostgreSQL experience preferred)
- Basic DevOps—Docker deployments, CI/CD concepts (i.e. Azure)