Skip to main content

Databricks' Naveen Rao on Data as IP, Custom AI Models, and Future Infrastructure

[HPP] Naveen RaoJuly 8, 202523 min
32 connections·40 entities in this video→

Data as Intellectual Property

  • πŸ’‘ Data is seen as the ingredients to create valuable assets, moving beyond simple analytics to define intellectual property.
  • 🎯 The core thesis behind the MosaicML acquisition by Databricks was to unlock data value through customized AI models.
  • πŸ”‘ The manifestation of how data is used is evolving, including contextualizing responses, fine-tuning, and building better evaluation criteria.

The Importance of AI Model Evaluation

  • βœ… An evaluation-first mindset is crucial for defining what success means for AI tasks and goals.
  • πŸ€– Automating the evaluation process is key to assessing and customizing models effectively.
  • πŸ”¬ This involves checking hypothesized models against real observational data to validate their accuracy.

Dynamic Dashboards and Workflow Automation

  • ✨ Databricks has embedded "genie" dashboards that provide dynamic, on-demand insights from data.
  • πŸ“Š These tools can generate SQL code on the fly to process and plot data as requested by users.
  • πŸš€ The ultimate goal is to create self-learning, reinforcing loops that continuously improve AI systems.

Advancements in AI Infrastructure

  • ⚑ Chip-to-chip bandwidth and memory bandwidth are identified as the primary drivers for future AI performance.
  • 🧠 These hardware improvements enable larger context windows and are essential for reducing hallucinations in models.
  • ⚠️ Current physics are reaching limits, making traditional scaling difficult and raising concerns about power consumption.

Future AI Directions and Enterprise Strategy

  • 🌱 Future investments are focused on defining true intelligence and enabling model-to-model collaboration.
  • πŸ› οΈ AI is expected to revolutionize engineering design processes, fostering a Cambrian explosion of creativity for developers.
  • πŸ“ˆ Enterprises should prioritize evaluation first, design workflow-aware user interfaces, and establish dedicated AI and automation groups.
Knowledge graph40 entities Β· 32 connections

How they connect

An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.

Hover Β· drag to explore
40 entities
Chapters10 moments

Key Moments

Transcript85 segments

Full Transcript

Topics15 themes

What’s Discussed

AIDatabricksGenerative AIData as Intellectual PropertyCustom Model TrainingAI Model EvaluationGenie DashboardsSemiconductor TechnologyMemory BandwidthContext WindowsAI HallucinationsModel-to-model CollaborationWorkflow-aware User InterfacesVector DatabasesRetrieval Augmented Generation (RAG)
Smart Objects40 Β· 32 links
PeopleΒ· 2
CompaniesΒ· 11
ConceptsΒ· 20
ProductsΒ· 6
MediaΒ· 1