Tesla's Full Self-Driving (FSD) is a set of advanced safety and autonomous driving features that are available for Tesla electric vehicles (EVs). It enables Tesla vehicles to drive semi-autonomously (legally, level 2 autonomy). Currently, FSD is not capable of "fully" self-driving, but Tesla's ultimate goal is to achieve level 5 autonomy.

Source: Semiconductor Engineering

Three tiers of autonomous driving software are available on Tesla EVs: autopilot, enhanced autopilot, and FSD. Autopilot is standard. Enhanced autopilot is a $6,000 software add-on. FSD is a $15,000 software add-on. The price of FSD typically increases several times every year, as the software improves.

Source: Not a Tesla App

Tesla vehicles have been equipped with autopilot functionality for years, however, it was generally limited to freeway driving. FSD allows for city street driving and can automatically start and stop at intersections and stop signs.

Autopilot Features

Traffic-Aware Cruise Control: Matches the speed of your car to that of the surrounding traffic
Autosteer: Assists in steering within a clearly marked lane, and uses traffic-aware cruise control

Enhanced Autopilot Features

Autopilot features.
Navigate on Autopilot: Actively guides your car from a highway’s on-ramp to off-ramp, including suggesting lane changes, navigating interchanges, automatically engaging the turn signal, and taking the correct exit.
Auto Lane Change: Assists in moving to an adjacent lane on the highway when Autosteer is engaged.
Autopark: Helps automatically parallel or perpendicular park your car, with a single touch.
Summon: Moves your car in and out of a tight space using the mobile app or key.
Smart Summon: Your car will navigate more complex environments and parking spaces, maneuvering around objects as necessary to come find you in a parking lot.

FSD Features

Enhanced autopilot features.
Traffic and Stop Sign Control: Identifies stop signs and traffic lights and automatically slows your car to a stop on approach, with your active supervision
Autosteer on city streets.

How Does FSD Work?

Tesla's AI team developed advanced machine learning techniques to teach Tesla EVs how to drive autonomously.

Generally, machine learning is a type of applied statistics that focuses heavily on utilizing computers to statistically evaluate complex functions, while placing less importance on determining the confidence intervals surrounding these functions. At its core, machine learning is a way for computers to learn from data. But what does it mean for a computer to "learn"? In his textbook Machine Learning: A Multistrategy Approach, Tom Mitchell defines it as "a computer program that improves its performance on a specific set of tasks, as measured by a specific metric, through experience."

Starting from first principles, if humans can only drive using vision, then it should be possible for a computer to drive using only cameras. Musk has described human drivers as having two cameras that can only look in one direction at a time on a gimbal. Humans lose vision when they blink or look in a different direction and can easily be distracted. Cameras, on the other hand, can run continuously, and with Tesla's 8-camera setup, the entire perimeter of the vehicle is constantly monitored. The 8 cameras on a Tesla are 1280 x 960 12-Bit (HDR) @ 36Hz. Unlike competitors, they forgo using Lidar and mmWave radar.

Source: Tesla

The figure below shows how the human and primate cerebral cortex process vision. When the information is received by the retina, it is processed through various areas, streams, and layers of the cerebral cortex, leading to biological vision. These areas and organs include the optic chiasm, the lateral geniculate nucleus, the primary visual cortex, the extrastriate cortex, and the inferior temporal area.

Table of Contents

Introduction

What is FSD?

Autopilot Features

Enhanced Autopilot Features

FSD Features

How Does FSD Work?

How Did Tesla Solve This?

Lidar vs Vision

Part 1

Architectural Layout

Neural Network Backbone

Detection Head

HydraNets

Part 2

Vector Space

Occupancy Tracker

Transformer

Transformer For Self-Driving

Inference

Virtual Camera

Video Neural Net Architecture

Feature Queue

Time-Based Queue

Space-Based Queue

Video Module

Recurrent Neural Network

Spatial Recurrent Neural Network

Tesla Vision Structure

Part 3

Planning and Control

Non-Convex

High-Dimensional

Lane Changes

Narrow Roads

Parking Problem

Baseline

Using Navigation

Monte Carlo Tree Search

The Final Architecture

Part 4

Auto Labeling and Simulation

Training Data

Manual Labeling

Auto Labeling

Clip

Maps

Remove Radar

Simulation

Accurate Sensor Simulation

Photorealistic Rendering

Diverse Actors and Locations

Scalable Scenario Generation

Scenario Reconstruction

Neural Rendering

Part 5

Updates

Overview

Planning

Occupancy Network

Training Infrastructure

Lanes and Objects

AI Compiler and Inference

Auto Labeling

Simulation

Data Engine

Part 6

Dojo

What is Dojo?

Self-Supervised Training

Object Location Understanding

Sources and Further Reading

Read next

Comments ( )

Comments ()