AI/ML Intern

LakeFusion seeks an AI/ML Intern to support the design, testing, and evaluation of machine learning models and LLM-powered workflows that enhance entity resolution, data matching, and analytics across our Databricks-native MDM platform.

3 Months

USA

1 Opening

About the Role

We are looking for an AI/ML Intern to support the development of AI capabilities within the LakeFusion.ai platform. This role is intended for candidates with a strong foundation in Data Science, AI, or Computer Science and mandatory exposure to Large Language Models (LLMs) and Generative AI concepts. You will work closely with senior engineers on applying AI and LLM techniques to real enterprise data management problems such as entity resolution, intelligent matching, and data enrichment.

Note: This role is only for students who recently graduated and are on F1 OPT status.

What you’ll do

  • Assist in designing and testing LLM-based solutions for enterprise data management use cases
  • Apply prompt engineering techniques for structured and unstructured enterprise data
  • Support AI/ML model experimentation, evaluation, and documentation
  • Use SQL for data engineering and data analysis
  • Track and evaluate emerging trends in LLMs, Generative AI, and applied AI

What We're Looking For

  • Bachelor’s or Master’s degree in Data Science, AI Engineering, Computer Science, or a related field
  • Mandatory exposure to LLMs and Generative AI concepts through real-world implementation or projects
  • Hands-on experience with prompt engineering
  • Strong SQL skills
  • Programming experience in Python

Nice-to-Have

  • Exposure to enterprise data management or MDM concepts
  • Familiarity with Databricks or lakehouse architectures
  • Experience evaluating LLM performance and prompt optimization
  • Understanding of entity resolution, data matching, or data quality workflows
  • Interest in applied AI solutions for real-world business problems

About Company

LakeFusion.ai is an AI-powered Master Data Management (MDM) platform built for modern lakehouse environments. The platform helps enterprises unify data from multiple systems into a single, trusted source of truth using intelligent matching, entity resolution, and automated data quality rules. LakeFusion integrates with lakehouse architectures such as Databricks to deliver scalable, governed master data across business domains.

NewsLetter

Stay Ahead in Enterprise Data

Insights on master data management, Databricks, and building AI-ready data platforms—delivered occasionally, without the noise.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.