- Built a summary-first extraction pipeline (tender metadata in under 5 minutes), RAG-based semantic search over 300+ page documents, and a hybrid bid eligibility engine using policy thresholds, fuzzy region matching, and custom LLM rules for bid/no-bid decisions.
- Implemented a two-phase BOQ extraction pipeline (detect → per-chunk extract) with multi-BOQ evaluation framework; engineered a DOCX form-fill engine for automated bid document generation.
- Built a ReAct-style BidDraftingAgent with 4 specialized tools (pgvector tender search, vault RAG, past bids retrieval, company metadata) that autonomously drafts legally-compliant bid documents, cutting preparation time from days to under an hour.
- Owned end-to-end AWS deployment (EC2, RDS, S3) via Docker Compose; implemented Google OAuth + bcrypt/JWT auth, brute-force rate limiting, XSS sanitization, and idempotent SQL migrations for zero-downtime schema rollouts.
Rahul Singh
AI / ML Engineer
I build ML systems end-to-end — from training custom models to deploying them on AWS.
Currently the sole engineer at a startup where I built RAG pipelines, LLM agents, and a full production stack from scratch. Before that, I beat AWS Textract's accuracy at 40% less cost.
4x national hackathon winner, AIR 192 in the Amazon ML Challenge, and building a GPT from scratch in Go because why not.
Work Experience
- Dec2025 - PresentLastDraft AIFounding Engineer
- Sep2025 - Dec2025DocxsterMachine Learning Engineer
- Built a Workflow Automation Bot that interprets natural language requirements and auto-generates tailored workflows, reducing creation time from 30 minutes to 2 minutes and driving a 30%+ increase in workflow adoption.
- Built a cost-efficient document processing module integrating Visual Language Models (VLMs) with OCR, achieving Textract-level accuracy while reducing inference costs by 40% across varied document types.
- Mar2025 - Aug2025DocxsterMachine Learning Intern
- Engineered a synthetic data generation pipeline using a GAN to produce a custom dataset of 100,000+ document images, solving the critical lack of complex, real-world training data.
- Architected and developed DocStruct-YOLO, a YOLOv10-based model for Document Layout Analysis (DLA), precisely identifying tables, text, and figures as the foundational step for a proprietary extraction module.
- Jul2024 - Aug2024Sudha Gopalakrishnan Brain Centre, IIT MadrasMachine Learning Intern
- Developed a Python pipeline to convert SVG brain annotations into validated GeoJSON format, utilizing geospatial libraries to enable accurate mapping and interoperability with advanced analysis tools.
- Fine-tuned ResNet-50 for brain region similarity analysis, generating quantitative similarity scores that reduced manual review time by 10 hours/week and accelerated the research pipeline.
Recent projects
See all projectsLatest posts
See all postsLet's Connect
If you want to get in touch with me about something or just to say hi, reach out on social media or send me an email.