-
Notifications
You must be signed in to change notification settings - Fork 12
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Jay-Ye
committed
Nov 19, 2023
1 parent
7b0e7b9
commit 766fc58
Showing
18 changed files
with
3,341 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1,259 @@ | ||
<!DOCTYPE html> | ||
<html> | ||
<head> | ||
<meta charset="utf-8"> | ||
<meta name="description" | ||
content="LLMs as an Agent for Autonomous Driving"> | ||
<meta name="keywords" content="LLM, Agent, Autonomous Driving"> | ||
<meta name="viewport" content="width=device-width, initial-scale=1"> | ||
<title>Agent-Driver</title> | ||
|
||
<!-- Global site tag (gtag.js) - Google Analytics --> | ||
<script async src="https://www.googletagmanager.com/gtag/js?id=G-DVW0FRCNQ8"></script> | ||
<script> | ||
window.dataLayer = window.dataLayer || []; | ||
|
||
function gtag() { | ||
dataLayer.push(arguments); | ||
} | ||
|
||
gtag('js', new Date()); | ||
|
||
gtag('config', 'G-PYVRSFMDRL'); | ||
</script> | ||
|
||
<link href="https://fonts.googleapis.com/css?family=Google+Sans|Noto+Sans|Castoro" | ||
rel="stylesheet"> | ||
|
||
<link rel="stylesheet" href="./static/css/bulma.min.css"> | ||
<link rel="stylesheet" href="./static/css/bulma-carousel.min.css"> | ||
<link rel="stylesheet" href="./static/css/bulma-slider.min.css"> | ||
<link rel="stylesheet" href="./static/css/fontawesome.all.min.css"> | ||
<link rel="stylesheet" | ||
href="https://cdn.jsdelivr.net/gh/jpswalsh/academicons@1/css/academicons.min.css"> | ||
<link rel="stylesheet" href="./static/css/index.css"> | ||
<link rel="icon" href="./static/images/gpt.png"> | ||
|
||
<script src="https://ajax.googleapis.com/ajax/libs/jquery/3.5.1/jquery.min.js"></script> | ||
<script defer src="./static/js/fontawesome.all.min.js"></script> | ||
<script src="./static/js/bulma-carousel.min.js"></script> | ||
<script src="./static/js/bulma-slider.min.js"></script> | ||
<script src="./static/js/index.js"></script> | ||
</head> | ||
<body> | ||
|
||
<nav class="navbar" role="navigation" aria-label="main navigation"> | ||
<div class="navbar-brand"> | ||
<a role="button" class="navbar-burger" aria-label="menu" aria-expanded="false"> | ||
<span aria-hidden="true"></span> | ||
<span aria-hidden="true"></span> | ||
<span aria-hidden="true"></span> | ||
</a> | ||
</div> | ||
<div class="navbar-menu"> | ||
<div class="navbar-start" style="flex-grow: 1; justify-content: center;"> | ||
<a class="navbar-item" href="https://PointsCoder.github.io"> | ||
<span class="icon"> | ||
<i class="fas fa-home"></i> | ||
</span> | ||
</a> | ||
</div> | ||
|
||
</div> | ||
</nav> | ||
|
||
<section class="hero"> | ||
<div class="hero-body"> | ||
<div class="container is-max-desktop"> | ||
<div class="columns is-centered"> | ||
<div class="column has-text-centered"> | ||
<h1 class="title is-1 publication-title">A Language Agent for Autonomous Driving</h1> | ||
<div class="is-size-5 publication-authors"> | ||
<span class="author-block"> | ||
<a href="https://PointsCoder.github.io">Jiageng Mao</a><sup>1*</sup>,</span> | ||
<span class="author-block"> | ||
<a href="https://jay-ye.github.io/">Junjie Ye</a><sup>1*</sup>,</span> | ||
<span class="author-block"> | ||
<a href="https://www.linkedin.com/in/yuxiqian">Yuxi Qian</a><sup>1</sup>,</span> | ||
<span class="author-block"> | ||
<a href="https://web.stanford.edu/~pavone/index.html">Marco Pavone</a><sup>2,3</sup>, | ||
</span> | ||
<span class="author-block"> | ||
<a href="https://yuewang.xyz">Yue Wang</a><sup>1,3</sup> | ||
</span> | ||
</div> | ||
|
||
<div class="is-size-5 publication-authors"> | ||
<span class="author-block"><sup>1</sup>University of Southern California,</span> | ||
<span class="author-block"><sup>2</sup>Stanford University,</span> | ||
<span class="author-block"><sup>3</sup>NVIDIA</span> | ||
</div> | ||
|
||
<div class="is-size-5 publication-authors"> | ||
<span class="author-block">* indicates equal contribution.</span> | ||
</div> | ||
|
||
<div class="column has-text-centered"> | ||
<div class="publication-links"> | ||
<!-- PDF Link. --> | ||
<span class="link-block"> | ||
<a href="" | ||
class="external-link button is-normal is-rounded is-dark"> | ||
<span class="icon"> | ||
<i class="fas fa-file-pdf"></i> | ||
</span> | ||
<span>Paper</span> | ||
</a> | ||
</span> | ||
<span class="link-block"> | ||
<a href="" | ||
class="external-link button is-normal is-rounded is-dark"> | ||
<span class="icon"> | ||
<i class="ai ai-arxiv"></i> | ||
</span> | ||
<span>arXiv</span> | ||
</a> | ||
</span> | ||
<!-- Video Link. --> | ||
<!-- <span class="link-block"> | ||
<a href="https://www.youtube.com/watch?v=MrKrnHhk8IA" | ||
class="external-link button is-normal is-rounded is-dark"> | ||
<span class="icon"> | ||
<i class="fab fa-youtube"></i> | ||
</span> | ||
<span>Video</span> | ||
</a> | ||
</span> --> | ||
<!-- Code Link. --> | ||
<span class="link-block"> | ||
<a href="" | ||
class="external-link button is-normal is-rounded is-dark"> | ||
<span class="icon"> | ||
<i class="fab fa-github"></i> | ||
</span> | ||
<span>Code (coming soon)</span> | ||
</a> | ||
</span> | ||
<!-- Dataset Link. --> | ||
<!-- <span class="link-block"> | ||
<a href="https://github.com/google/nerfies/releases/tag/0.1" | ||
class="external-link button is-normal is-rounded is-dark"> | ||
<span class="icon"> | ||
<i class="far fa-images"></i> | ||
</span> | ||
<span>Data</span> | ||
</a> --> | ||
</div> | ||
|
||
</div> | ||
</div> | ||
</div> | ||
</div> | ||
</div> | ||
</section> | ||
|
||
<section class="hero teaser"> | ||
<div class="container is-max-desktop"> | ||
<div class="hero-body"> | ||
<img src="static/images/teaser_wide.png" alt="Agent-Driver"> | ||
<h2 class="subtitle has-text-centered"> | ||
<span class="dnerf">Agent-Driver</span> transforms the conventional perception-prediction-planning framework by introducing LLMs as an agent for autonomous driving. | ||
</h2> | ||
</div> | ||
</div> | ||
</section> | ||
|
||
<section class="section"> | ||
<div class="container is-max-desktop"> | ||
<!-- Abstract. --> | ||
<div class="columns is-centered has-text-centered"> | ||
<div class="column is-four-fifths"> | ||
<h2 class="title is-3">Abstract</h2> | ||
<div class="content has-text-justified"> | ||
<p> | ||
Human-level driving is an essential goal of autonomous driving. Conventional approaches formulate autonomous driving as a perception-prediction-planning framework, yet their systems do not capitalize on the inherent reasoning ability and experiential knowledge of humans. In this paper, we propose a fundamental paradigm shift from current pipelines, exploiting Large Language Models (LLMs) as a cognitive agent to integrate human-like intelligence into autonomous driving systems. Our system, termed Agent-Driver, transforms the traditional autonomous driving pipeline by introducing a versatile tool library accessible via function calls, a cognitive memory of common sense and experiential knowledge for decision-making, and a reasoning engine capable of chain-of-thought reasoning, task planning, motion planning, and self-reflection. Powered by LLMs, our Agent-Driver is endowed with intuitive common sense and robust reasoning capabilities, thus enabling a more nuanced, human-like approach to autonomous driving. We evaluate our system on the large-scale nuScenes benchmark, and extensive experiments substantiate that our Agent-Driver significantly outperforms the state-of-the-art driving methods by a large margin. Our approach also demonstrates superior interpretability and few-shot learning ability to these methods. | ||
</p> | ||
</div> | ||
</div> | ||
</div> | ||
</div> | ||
</section> | ||
|
||
<section class="section"> | ||
<div class="container is-max-desktop"> | ||
<!-- Method. --> | ||
<div class="columns is-centered has-text-centered"> | ||
<div class="column is-full_width"> | ||
<hr> | ||
<h2 class="title is-3">Method</h2> | ||
<br> | ||
<img src="static/images/method.png" width="900" class="center"> | ||
<div class="content has-text-justified"> | ||
<br> | ||
<p> | ||
</p><li> | ||
We present Agent-Driver, an LLM-powered agent that revolutionizes the traditional perception-prediction-planning framework, establishing a powerful yet flexible paradigm for human-like autonomous driving. | ||
</li> | ||
<li> | ||
Agent-Driver integrates a tool library for dynamic perception and prediction, a cognitive memory for human knowledge, and a reasoning engine that emulates human decision-making, all orchestrated by LLMs to enable a more anthropomorphic autonomous driving process. | ||
</li> | ||
<li> | ||
Agent-Driver significantly outperforms the state-of-the-art autonomous driving systems by a large margin, with over $30\%$ collision improvements in motion planning. Our approach also demonstrates strong few-shot learning ability and interpretability on the nuScenes benchmark. | ||
</li> | ||
<li> | ||
We provide a variety range of ablation study to dissect the proposed architecture and understand the efficacy of each module, to facilitate future research in this direction. | ||
</li> | ||
<p></p> | ||
</div> | ||
</div> | ||
</div> | ||
</div> | ||
</section> | ||
|
||
<section class="section"> | ||
<div class="container is-max-desktop"> | ||
<!-- Method. --> | ||
<div class="columns is-centered has-text-centered"> | ||
<div class="column is-full_width"> | ||
<hr> | ||
<h2 class="title is-3">Demos</h2> | ||
<br> | ||
<img src="static/images/viz.png" width="1200" class="center"> | ||
<div class="content has-text-justified"> | ||
|
||
</div> | ||
</div> | ||
</div> | ||
<hr> | ||
</div> | ||
</section> | ||
|
||
<section class="section" id="BibTeX"> | ||
<div class="container is-max-desktop content"> | ||
<h2 class="title">BibTeX</h2> | ||
<pre><code>@article{mao2023agentdriver, | ||
author = {Mao, Jiageng and Ye, Junjie and Qian, Yuxi and Pavone, Marco and Wang, Yue}, | ||
title = {A Language Agent for Autonomous Driving}, | ||
year = {2023}, | ||
}</code></pre> | ||
</div> | ||
</section> | ||
|
||
|
||
<footer class="footer"> | ||
<div class="container"> | ||
<div class="columns is-centered"> | ||
<div class="column is-8"> | ||
<div class="content"> | ||
<p> | ||
Website template is borrowed from <a | ||
href="https://github.com/nerfies/nerfies.github.io">nerfies</a>. | ||
</p> | ||
</div> | ||
</div> | ||
</div> | ||
</div> | ||
</footer> | ||
|
||
</body> | ||
</html> |
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Oops, something went wrong.
Oops, something went wrong.