ML Researcher - Vision & VLMs (US)
Autonomize
Software Engineering, Data Science
Austin, TX, USA
Posted on Dec 22, 2025
About Autonomize AI
Autonomize AI is revolutionizing healthcare by streamlining knowledge workflows with AI. We reduce administrative burdens and elevate outcomes, empowering professionals to focus on what truly matters — improving lives. We're growing fast and looking for bold, driven teammates to join us.
The Opportunity
We’re seeking a world-class ML Researcher with deep hands-on experience in vision-language models (VLMs) like CLIP, Flamingo, GLaMM, or Gemini, and a strong understanding of the healthcare data ecosystem. You'll lead applied research efforts to enable agents that can understand, reason over, and ground decisions in images, scanned documents, medical forms, and charts. This role is at the intersection of research, product, and platform—your work will power real-time decisioning systems used by clinicians, case managers, and analysts in complex healthcare settings.
Key Responsibilities
- Research and build state-of-the-art VLM pipelines for interpreting
- Scanned PDFs, handwritten notes, claim attachments, EOBs, lab results
- Radiology images, charts, or diagrams paired with text
- Fine-tune or adapt models (e.g., BLIP-2, MiniGPT-4, LLaVA, GLaMM) to healthcare tasks
- Develop novel multimodal grounding strategies for RAG and Agent workflows
- Evaluate and optimize models for factuality, safety, and explainability
- Collaborate with product and platform engineers to deploy models at scale
- Author technical reports, patents, or research papers based on innovation
Requirements
- PhD or Master’s in ML, CV, NLP, or related field from a top-tier institution (publications in CVPR, NeurIPS, EMNLP, etc. a plus)
- 5+ years of experience working on vision or VLMs in academic or industry settings
- Proven experience applying ML to healthcare documents or medical imaging
- Strong hands-on with PyTorch, HuggingFace, OpenCLIP, OpenVINO, or MMDet
- Comfortable with LLMs, prompt tuning, and retrieval-augmented generation (RAG) pipelines
- Deep understanding of HIPAA, PHI protection, and healthcare data handling
Bonus Points
- Familiarity with OCR systems (Tesseract, PaddleOCR, LayoutLM) and document intelligence
- Experience working with EHR systems, claims, or clinical ontologies (e.g., SNOMED, ICD-10)
- Contributions to open-source or academic research in VLMs or medical vision
- Knowledge of data-centric AI, weak supervision, or few-shot learning in clinical contexts
What we offer
- A chance to make a real impact in the future of healthcare
- Autonomy, ownership, and the ability to chart your own growth path
- Competitive compensation and benefits
- 100% employer-paid health, vision, and dental insurance
- Retirement plans (401k), disability insurance, employee assistance programs
How to Apply
Send your resume and a brief cover letter to careers@autonomize.ai explaining why you're the right person for this job