Lighthouz AI
Lighthouz AI

AI Engineer - Document Intelligence (2-6 Years Experience)

  • Full-time
  • 💰negotiable
  • 3 months ago
llm
agents
ml
openai
transformers
aws
ai
gcp
inference
nlp
rag
tensorflow
azure
pytorch
claude
prompting
**Location:** Hybrid - 2-3 days in office (Delhi / Gurgaon) **Company:** Lighthouz AI **Domain:** B2B SaaS ### **About Lighthouz AI** Lighthouz AI is automating the back office accounting of freight brokers with freight-native AI agents. Our AI agents process messy document paperwork in accounting in secondsnot hoursby replacing manual data verification, extraction, & audits and brittle RPA. Our platform thrives in real-world document chaosscanned and handwritten paperwork, and ambiguous emailsexecuting complex workflows automatically. The result: faster invoicing, quicker time-to-get-paid, fewer disputes, and 10x operational growth. We're a Y Combinator S24 company, founded by a team with deep experience across AI, supply chain, and enterprise systems (Google, Georgia Tech, Progressive). At Lighthouz, we're not just streamlining freight financewe're rebuilding it from the ground up. * * * ### **Role Overview** We're seeking an AI Engineer (Document Intelligence) to design, build, and maintain intelligent agents that will automate and transform document-based freight workflows. You'll work on the two core components of our AI agents first, the core perception systems that extract structured insights from messy, real-world freight documentshandwritten, scanned, distorted, or multi-page and second, our AI agents for email communications between freight parties. You will push the boundaries of creative prompt engineering to solve real-world problems at scale, fine-tuning LLMs, building large-scale document classification and entity extraction models, communication understanding, intent classification your code will be at the heart of automating financial decision-making in freight. You'll collaborate closely with the backend and product teams to bring AI models to life in production environments and continuously improve performance in the wild. **What You'll Do** * Create accurate and reliable systems to extract and analyze knowledge from documents * Architect, implement, and deploy AI agents for email and phone communications between freight accounting parties (payer/payee), leveraging language and vision LLMs for automation and analysis. * Design and refine high-impact prompts, templates, and evaluation harnesses to ensure robust, reliable agent behavior. * Build scalable pipelines for preprocessing, training, inference, and feedback loops, including evaluation and integration of VLMs. * Monitor and diagnose agent performance in production, rapidly addressing failures and refining prompts, workflows, and models. * Create and maintain high-quality training, evaluation, and test datasets. * Enhance the AI stack through creative prompting, fine-tuning, and continuous iteration. * Productionize models within Lighthouz's intelligent automation platform. * Collaborate with product and engineering teams to integrate AI outputs into document, email, and voice workflows, delivering polished, production-ready solutions. * Continuously improve model performance in real-world conditions. ### **What We're Looking For** * 2+ years in ML/AI roles, ideally in document AI * Deep learning expertise with PyTorch, TensorFlow * Prompting wizardry skilled at crafting precise, reliable prompts for VLMs & LLMs, translating complex tasks into actionable instructions * Experience fine-tuning VLMs * Strong knowledge of OpenAI, Claude, and other agentic toolkits * Hands-on with OCR, visual transformers, multimodal models * Background in conversational AI, voice AI, NLP research, and LLM training * Proven track record of training & deploying models to production * Problem-solver & builder mindset fast to prototype, faster to iterate * Comfortable with ambiguity and evolving datasets ### **Nice to Have** * Experience with AWS, Azure, or GCP-based ML infrastructure * Exposure to RAG pipelines, foundation models, or vector search systems * Knowledge of document layout understanding (e.g., Donut, LayoutLM, PubLayNet) * Background in building secure, production-grade ML services * Prior experience working in a startup ### **What We Offer** Competitive salary High ownership, zero bureaucracyhelp shape our AI stack from day one Work on impactful real-world problems that blend AI and automation at scale
Apply for this Job👉 Please reference you found the job on Remote Hits, this helps us get more companies to post here, thanks!

When applying for jobs, you should NEVER have to pay to apply. You should also NEVER have to pay to buy equipment which they then pay you back for later. Also never pay for trainings you have to do. Those are scams! NEVER PAY FOR ANYTHING! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. A good idea is to check the domain name for the site/email and see if it's the actual company's main domain name. Scams in remote work are rampant, be careful! Read more to avoid scams. When clicking on the button to apply above, you will leave Remote Hits and go to the job application page for that company outside this site. Remote Hits accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Looking for More Opportunities? 🔔

Get weekly email alerts with the latest remote jobs. Join 2M+ remote workers who never miss an opportunity.

📧 Get Weekly Remote Job Alerts

Get the best remote jobs delivered to your inbox weekly

🔒 We respect your privacy. Unsubscribe at any time.