Principal AI Engineer
EINO
Software Engineering, Data Science
New York, NY, USA
About the role
This role owns the design and implementation of AI agents that power intelligent workflows across our platform, from reasoning and automation to evidence-driven RCA, customer-facing copilots, internal operations, and product intelligence.
You’re a strong Python engineer who understands modern agentic AI architecture and can turn ambiguous product needs into production-grade systems. You are comfortable building agents that use tools, retrieve context, reason over structured and unstructured data, execute workflows, and integrate deeply with backend services.
This is not a prompt-only role. We’re looking for someone who can architect, build, test, deploy, and operate AI-powered features end to end.
What you’ll build
Agentic AI systems
- Production-grade AI agents for different product and internal applications:
- site-level connectivity analysis
- evidence-first root cause analysis workflows
- customer-facing assistant and copilot experiences
- internal automation for operations, support, and engineering workflows
- reasoning workflows over telemetry, incidents, site data, and knowledge sources
- Modern agentic architecture:
- tool/function calling
- planning and execution loops
- state and memory management
- retrieval-augmented generation
- structured outputs
- workflow orchestration
- multi-agent patterns where appropriate
- guardrails, permissions, and safety constraints
- observability and evaluation frameworks
- AI-native product primitives:
- agent task models
- context assembly pipelines
- tool registries
- prompt/version management
- human-in-the-loop review flows
- agent execution traces
- eval datasets and regression testing
Python backend + product engineering
- Backend services and APIs in Python:
- FastAPI/Django/Flask-style services
- workers and async jobs
- integrations with internal systems and external APIs
- data models supporting AI workflows
- event-driven and workflow-driven architectures
- Retrieval and knowledge systems:
- vector search
- hybrid search
- document ingestion
- chunking and indexing strategies
- metadata filtering
- grounding and citation workflows
- Production AI infrastructure:
- LLM provider integration
- model routing
- cost and latency optimization
- Caching
- rate limits and retries
- monitoring and debugging
- failure handling and fallback behavior
- Feature delivery end to end:
- product scoping
- Architecture
- Implementation
- Testing
- Deployment
- Observability
- iteration based on user feedback
Responsibilities
- Architect and implement production-grade AI agents that solve real business and product problems.
- Build agent workflows that can reason over Eino’s data, tools, telemetry, site models, incidents, and knowledge sources.
- Own agent architecture patterns across planning, memory, retrieval, tool execution, structured outputs, evals, and observability.
- Build and operate core Python backend services that support AI-powered product features.
- Work closely with product, engineering, and leadership to identify high-value agentic AI use cases.
- Move quickly from prototype to production while maintaining reliability, security, and maintainability.
- Establish testing and evaluation discipline for AI systems:
- unit/integration tests
- prompt and workflow regression tests
- agent evals
- golden datasets
- trace review
- failure analysis
- Drive practical AI engineering standards:
- correctness over demos
- grounded outputs
- measurable quality
- clear contracts between agents, tools, and backend services
Required qualifications
- Strong experience building production Python systems, including services, APIs, workers, and backend infrastructure.
- Hands-on experience building AI agents, LLM-powered applications, RAG systems, workflow automation, or tool-using AI systems.
- Deep familiarity with modern agentic AI architectures, including tool/function calling, planning and execution loops, state and memory management, retrieval, structured outputs, guardrails, observability, and evaluation frameworks.
- Ability to take features from ambiguous requirements to production deployment.
- Strong software engineering fundamentals: system design, testing, debugging, performance, reliability, and maintainability.
- Experience integrating AI systems with real products, databases, APIs, and operational workflows.
- Strong product sense and ability to identify where AI agents create practical customer or business value.
- Startup mindset: high ownership, bias to shipping, comfort with ambiguity, and ability to operate with limited direction.
- Experience working in a fast-moving Seed, Series A, or similarly early-stage startup environment.
Preferred / nice-to-have
- Experience with agent frameworks such as LangGraph, LangChain, LlamaIndex, CrewAI, AutoGen, or similar.
- Experience with LLM APIs such as OpenAI, Anthropic, Google Gemini, or open-source model deployments.
- Experience with vector databases and search systems such as pgvector, Pinecone, Weaviate, Qdrant, Milvus, Elasticsearch, or OpenSearch.
- Experience building AI evaluation pipelines, agent test harnesses, prompt regression systems, or human-in-the-loop review workflows.
- Experience with backend infrastructure such as Postgres, Redis, queues/workers, event pipelines, object storage, and cloud services.
- Experience with AWS, GCP, or Azure deployment patterns for Python services.
- Experience building AI systems for B2B SaaS, enterprise software, infrastructure, telecom, networking, or data platforms.
- Familiarity with observability tools for AI systems, including tracing, latency monitoring, cost tracking, and quality evaluation.
- Experience with secure AI system design, including permissions, data access boundaries, audit logs, and safe tool execution.
Culture & operating principles
- Ownership is real: if an agent fails, behaves unpredictably, or creates user confusion, we debug it, fix it, and harden the system.
- Bias to shipping: meaningful progress weekly; fast prototypes; production mindset; tight feedback loops.
- Practical AI over hype: we build systems that work reliably, not demos that only look good once.
- High standards on reliability: correctness, grounding, evals, and observability matter.
- End-to-end thinking: agents, backend systems, product workflows, and user experience are one system.
- Low ego, high velocity: debate the idea, be direct and respectful, and keep moving.
What success looks like
- Multiple production AI agents are shipped and actively used across product and internal workflows.
- Agent workflows are reliable, observable, and measurable through evals and execution traces.
- AI features move from prototype to production quickly without becoming fragile one-off demos.
- Agents can safely use tools, retrieve context, produce structured outputs, and integrate with backend services.
- Evals and regression testing become part of the AI development lifecycle.
- Product and engineering teams can confidently identify, build, and expand agentic AI use cases.
About Eino
Eino is building the world’s first Connectivity Digital Twin — a deeply technical, AI-driven simulation platform that models real-world environments and their connectivity layers. We solve hard, physical-world problems using advanced AI, high-performance backend systems, geometric computation, and rich large-scale data.
We are a real AI company: not LLM-wrapper tooling, but deep tech. Our work spans geometry processing, spatial reasoning, GPU-accelerated simulation, data modeling, and agentic AI systems. Every engineer at Eino works on highly challenging problems and collaborates with a team of exceptional, experienced builders.
We are well-funded by strong investors, have real customers, and a clear roadmap to reshape how the world understands connectivity.
This is a rare opportunity for a hungry, entrepreneurial AI engineer to join a rocket-ship Seed/Series A startup at the ground level, help architect our agentic AI systems, and grow into the owner of critical AI product infrastructure.