Trevor Darrell

University of California, Berkeley

Top CV ResearchersFrontier Research MapScore: 9h-index: 156225,499 citations

Homepage Berkeley homepage UCLA CS 201 talk page Berkeley research profile Semantic Scholar

Top CV Researcher — Rank #3 (top 10)

Professor; founding co-director of BAIR

Contributions

Caffe, visual recognition, multimodal learning, embodied perception

Why Selected

Long-running leader in visual recognition, multimodal learning, and large-scale vision systems, with strong academic and translational impact.

Score Breakdown

historical impact

recent visibility

current influence

asset availability

total

Frontier Research Map

Featured Work

The Surprising Efficacy of "Ungrounded" Models for Image and Video Understanding, and Generation

official seminar page — 2024-03-12

Why Now

This is a useful frontier tension: language priors are often shockingly effective even when they are not deeply grounded in vision.

Key Ideas

-Language models can contribute nontrivial structure to image and video reasoning even without direct physical grounding.
-Visual systems increasingly look like orchestration layers over heterogeneous pretrained modules.
-Useful multimodal intelligence may emerge from composition before full grounding is solved.

Open Questions

?Where is the boundary between productive language prior and confident hallucination?
?Should we design multimodal systems as monoliths or as compositional toolchains?
?What tasks punish ungrounded shortcuts strongly enough to force genuine perception?

Canonical CV Leadershigh confidence

← Back to People