Skip to content

Commit

Permalink
init
Browse files Browse the repository at this point in the history
  • Loading branch information
Jay-Ye committed Nov 19, 2023
1 parent 7b0e7b9 commit 766fc58
Show file tree
Hide file tree
Showing 18 changed files with 3,341 additions and 0 deletions.
258 changes: 258 additions & 0 deletions index.html
Original file line number Diff line number Diff line change
@@ -1 +1,259 @@
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8">
<meta name="description"
content="LLMs as an Agent for Autonomous Driving">
<meta name="keywords" content="LLM, Agent, Autonomous Driving">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>Agent-Driver</title>

<!-- Global site tag (gtag.js) - Google Analytics -->
<script async src="https://www.googletagmanager.com/gtag/js?id=G-DVW0FRCNQ8"></script>
<script>
window.dataLayer = window.dataLayer || [];

function gtag() {
dataLayer.push(arguments);
}

gtag('js', new Date());

gtag('config', 'G-PYVRSFMDRL');
</script>

<link href="https://fonts.googleapis.com/css?family=Google+Sans|Noto+Sans|Castoro"
rel="stylesheet">

<link rel="stylesheet" href="./static/css/bulma.min.css">
<link rel="stylesheet" href="./static/css/bulma-carousel.min.css">
<link rel="stylesheet" href="./static/css/bulma-slider.min.css">
<link rel="stylesheet" href="./static/css/fontawesome.all.min.css">
<link rel="stylesheet"
href="https://cdn.jsdelivr.net/gh/jpswalsh/academicons@1/css/academicons.min.css">
<link rel="stylesheet" href="./static/css/index.css">
<link rel="icon" href="./static/images/gpt.png">

<script src="https://ajax.googleapis.com/ajax/libs/jquery/3.5.1/jquery.min.js"></script>
<script defer src="./static/js/fontawesome.all.min.js"></script>
<script src="./static/js/bulma-carousel.min.js"></script>
<script src="./static/js/bulma-slider.min.js"></script>
<script src="./static/js/index.js"></script>
</head>
<body>

<nav class="navbar" role="navigation" aria-label="main navigation">
<div class="navbar-brand">
<a role="button" class="navbar-burger" aria-label="menu" aria-expanded="false">
<span aria-hidden="true"></span>
<span aria-hidden="true"></span>
<span aria-hidden="true"></span>
</a>
</div>
<div class="navbar-menu">
<div class="navbar-start" style="flex-grow: 1; justify-content: center;">
<a class="navbar-item" href="https://PointsCoder.github.io">
<span class="icon">
<i class="fas fa-home"></i>
</span>
</a>
</div>

</div>
</nav>

<section class="hero">
<div class="hero-body">
<div class="container is-max-desktop">
<div class="columns is-centered">
<div class="column has-text-centered">
<h1 class="title is-1 publication-title">A Language Agent for Autonomous Driving</h1>
<div class="is-size-5 publication-authors">
<span class="author-block">
<a href="https://PointsCoder.github.io">Jiageng Mao</a><sup>1*</sup>,</span>
<span class="author-block">
<a href="https://jay-ye.github.io/">Junjie Ye</a><sup>1*</sup>,</span>
<span class="author-block">
<a href="https://www.linkedin.com/in/yuxiqian">Yuxi Qian</a><sup>1</sup>,</span>
<span class="author-block">
<a href="https://web.stanford.edu/~pavone/index.html">Marco Pavone</a><sup>2,3</sup>,
</span>
<span class="author-block">
<a href="https://yuewang.xyz">Yue Wang</a><sup>1,3</sup>
</span>
</div>

<div class="is-size-5 publication-authors">
<span class="author-block"><sup>1</sup>University of Southern California,</span>
<span class="author-block"><sup>2</sup>Stanford University,</span>
<span class="author-block"><sup>3</sup>NVIDIA</span>
</div>

<div class="is-size-5 publication-authors">
<span class="author-block">* indicates equal contribution.</span>
</div>

<div class="column has-text-centered">
<div class="publication-links">
<!-- PDF Link. -->
<span class="link-block">
<a href=""
class="external-link button is-normal is-rounded is-dark">
<span class="icon">
<i class="fas fa-file-pdf"></i>
</span>
<span>Paper</span>
</a>
</span>
<span class="link-block">
<a href=""
class="external-link button is-normal is-rounded is-dark">
<span class="icon">
<i class="ai ai-arxiv"></i>
</span>
<span>arXiv</span>
</a>
</span>
<!-- Video Link. -->
<!-- <span class="link-block">
<a href="https://www.youtube.com/watch?v=MrKrnHhk8IA"
class="external-link button is-normal is-rounded is-dark">
<span class="icon">
<i class="fab fa-youtube"></i>
</span>
<span>Video</span>
</a>
</span> -->
<!-- Code Link. -->
<span class="link-block">
<a href=""
class="external-link button is-normal is-rounded is-dark">
<span class="icon">
<i class="fab fa-github"></i>
</span>
<span>Code (coming soon)</span>
</a>
</span>
<!-- Dataset Link. -->
<!-- <span class="link-block">
<a href="https://github.com/google/nerfies/releases/tag/0.1"
class="external-link button is-normal is-rounded is-dark">
<span class="icon">
<i class="far fa-images"></i>
</span>
<span>Data</span>
</a> -->
</div>

</div>
</div>
</div>
</div>
</div>
</section>

<section class="hero teaser">
<div class="container is-max-desktop">
<div class="hero-body">
<img src="static/images/teaser_wide.png" alt="Agent-Driver">
<h2 class="subtitle has-text-centered">
<span class="dnerf">Agent-Driver</span> transforms the conventional perception-prediction-planning framework by introducing LLMs as an agent for autonomous driving.
</h2>
</div>
</div>
</section>

<section class="section">
<div class="container is-max-desktop">
<!-- Abstract. -->
<div class="columns is-centered has-text-centered">
<div class="column is-four-fifths">
<h2 class="title is-3">Abstract</h2>
<div class="content has-text-justified">
<p>
Human-level driving is an essential goal of autonomous driving. Conventional approaches formulate autonomous driving as a perception-prediction-planning framework, yet their systems do not capitalize on the inherent reasoning ability and experiential knowledge of humans. In this paper, we propose a fundamental paradigm shift from current pipelines, exploiting Large Language Models (LLMs) as a cognitive agent to integrate human-like intelligence into autonomous driving systems. Our system, termed Agent-Driver, transforms the traditional autonomous driving pipeline by introducing a versatile tool library accessible via function calls, a cognitive memory of common sense and experiential knowledge for decision-making, and a reasoning engine capable of chain-of-thought reasoning, task planning, motion planning, and self-reflection. Powered by LLMs, our Agent-Driver is endowed with intuitive common sense and robust reasoning capabilities, thus enabling a more nuanced, human-like approach to autonomous driving. We evaluate our system on the large-scale nuScenes benchmark, and extensive experiments substantiate that our Agent-Driver significantly outperforms the state-of-the-art driving methods by a large margin. Our approach also demonstrates superior interpretability and few-shot learning ability to these methods.
</p>
</div>
</div>
</div>
</div>
</section>

<section class="section">
<div class="container is-max-desktop">
<!-- Method. -->
<div class="columns is-centered has-text-centered">
<div class="column is-full_width">
<hr>
<h2 class="title is-3">Method</h2>
<br>
<img src="static/images/method.png" width="900" class="center">
<div class="content has-text-justified">
<br>
<p>
</p><li>
We present Agent-Driver, an LLM-powered agent that revolutionizes the traditional perception-prediction-planning framework, establishing a powerful yet flexible paradigm for human-like autonomous driving.
</li>
<li>
Agent-Driver integrates a tool library for dynamic perception and prediction, a cognitive memory for human knowledge, and a reasoning engine that emulates human decision-making, all orchestrated by LLMs to enable a more anthropomorphic autonomous driving process.
</li>
<li>
Agent-Driver significantly outperforms the state-of-the-art autonomous driving systems by a large margin, with over $30\%$ collision improvements in motion planning. Our approach also demonstrates strong few-shot learning ability and interpretability on the nuScenes benchmark.
</li>
<li>
We provide a variety range of ablation study to dissect the proposed architecture and understand the efficacy of each module, to facilitate future research in this direction.
</li>
<p></p>
</div>
</div>
</div>
</div>
</section>

<section class="section">
<div class="container is-max-desktop">
<!-- Method. -->
<div class="columns is-centered has-text-centered">
<div class="column is-full_width">
<hr>
<h2 class="title is-3">Demos</h2>
<br>
<img src="static/images/viz.png" width="1200" class="center">
<div class="content has-text-justified">

</div>
</div>
</div>
<hr>
</div>
</section>

<section class="section" id="BibTeX">
<div class="container is-max-desktop content">
<h2 class="title">BibTeX</h2>
<pre><code>@article{mao2023agentdriver,
author = {Mao, Jiageng and Ye, Junjie and Qian, Yuxi and Pavone, Marco and Wang, Yue},
title = {A Language Agent for Autonomous Driving},
year = {2023},
}</code></pre>
</div>
</section>


<footer class="footer">
<div class="container">
<div class="columns is-centered">
<div class="column is-8">
<div class="content">
<p>
Website template is borrowed from <a
href="https://github.com/nerfies/nerfies.github.io">nerfies</a>.
</p>
</div>
</div>
</div>
</div>
</footer>

</body>
</html>
1 change: 1 addition & 0 deletions static/css/bulma-carousel.min.css

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading

0 comments on commit 766fc58

Please sign in to comment.