Invented a novel UI-tree capture pipeline that generated 715 reliable chatbot actions in one month — where months of manual effort had produced only 3.
The RAG chatbot (see Milo AI project) needed structured action data describing every workflow in a feature-rich HR & logistics web portal. Documentation was sparse and outdated, leaving the chatbot with too few reliable answers. The team had spent months brainstorming approaches and only managed to hand-craft 3 actions.
I conceived and solely built a two-stage pipeline that captures live UI interactions and transforms them into localized action files for RAG ingestion:
| Component | Purpose | Key Tech |
|---|---|---|
| Web-UI-Tree-Builder | Browser bookmarklet records UI clicks and builds a hierarchical action tree | JavaScript · Node.js · FastAPI |
| Tree Host Server | Persists captured trees as JSONL, serves a live tree viewer | FastAPI · Docker |
| Action Generation | Extracts paths, generates AI summaries, adds descriptions & keywords | Python · Azure OpenAI |
| Localization | Translates actions and summaries into French & Italian | Azure OpenAI · FastAPI endpoints |
Conceived the entire UI-tree capture approach with no prior art found online; developed the bookmarklet, tree host server and action generation pipeline from scratch; manually clicked through the full web app UI within one week to build the action tree; championed the idea against skeptical project leads who doubted feasibility.
| Challenge | Mitigation & Result |
|---|---|
| No existing tools or approaches for UI-action extraction | Researched extensively, found no prior art → designed and built custom toolchain from scratch |
| Skeptical project leadership | Delivered working prototype quickly → results spoke for themselves |
| Content-Security-Policy blocking bookmarklet injection | Browser extension to disable CSP during capture sessions |
| Multilingual requirement (DE/FR/IT) | Automated LLM-based translation pipeline → full trilingual action set |
MILO5 Web UI → Bookmarklet (Alt+Click) → Tree Host Server → JSONL
↓
JSONL → Path Extraction → AI Summaries → Localization → Action JSON
↓
Final Action Files → Milo AI RAG Ingestion → Chatbot Answers