Skip to content

Latest commit

 

History

History
324 lines (250 loc) · 16.3 KB

README.md

File metadata and controls

324 lines (250 loc) · 16.3 KB

TLS 1.3 protocol implementation in Rust

This is an educational, partially incomplete project to learn about the TLS 1.3 protocol and the general challenges of implementing binary-level network protocols.

The project attempts to follow the standard RFC8466 as closely as possible in naming conventions and data structures when converting the standard into Rust implementation.

The implementation is not likely the most efficient in terms of performance, or might not apply all the best cryptographic practices (e.g. constant times). It leaves cryptographic operations mostly for external libraries, and we just generate some cryptographically random bits.

In this project, we are more interested in the TLS protocol at the byte level and how to implement decoders, create functional tests for them and integrate fuzzing straight from the beginning.

Most of the data structures and their encoders have been provided. The logic to implement the complete handshake, error checking and decoding is left mostly for the student. Error checking is already at a good level, with some intentional flaws.

The project currently supports:

  • TLS_CHACHA20_POLY1305_SHA25 Cipher Suite
  • Key exchange with X25519 and signatures with EdDSA (Elliptic Curve Diffie-Hellman key exchange using Curve25519 and Edwards-Curve Digital Signature Algorithm based on the same curve)

Credits to Michael Driscoll for his excellent TLS 1.3 illustration.

Quickstart for Rust

We do not provide teaching for the language itself, but we can help on many issues. If you have programmed with any low-level language, you should be able to grasp the basics quickly, if the Rust is still unknown.

If you fight with the compiler, it is just preventing potential bugs from the runtime instead. You have to adopt the idea that data is always owned by default. Read the chapter in Rust's book about ownership.

This project attempts to avoid some complex features of Rust. You don't need to know lifetimes to complete the TLS handshake and decrypt application data content. It is okay to make additional copying on this project with the cost of performance and increased memory usage.

For general guidance, check

  • The Rust Book
  • You can use ChatGPT, Copilot, or other LLM, for example, to explain some complex lines of code. They are usually very accurate.

To get started with Rust environment, use Rustup.

It is also important to get your IDE environment properly set up. For a beginner, the most straight-forward ways are:

  • Use VSCode with Rust Analyzer.
    • Configure VSCode to run rustfmt on save.
    • After that, the editor should be able to note most problems in the code, and for example, to run single functional tests from the IDE.
    • You can also use Clippy to find issues in your code. You can also configure VSCode to run it automatically in real-time, but it will take some resources.
    • Additionally, if you want to try, you can also use a student license to run GitHub Copilot. But don't adapt code pieces you don't understand. It also gives access to using GitHub Copilot Chat with your code. After that, you can even highlight specific code and ask questions or suggestions from the Copilot. But then again, do not use things blindly. However, it is extremely useful for this project, since we write some repetitive code and do large pattern matching.
  • Use the student license for JetBrains RustRover.
    • You can apply the same as above also for RustRover.

Usage

The program takes a single argument as a parameter. You can use the RUST_LOG environment variable to change the logging level. (one of debug, info, warn, error)

To run with debug level and install dependencies, just

RUST_LOG=debug cargo run cloudflare.com:443

This will do a partial TLS 1.3 handshake with cloudflare.com

What works?

It is very helpful to look at the process flow here to understand what is the current status.

The project implements the necessary data structures and cryptographic primitives to create the initial ClientHello message and convert it into raw bytes with correct length determinants.

It also implements minimal data structures and decoders to parse the first TLS Record from the server response, which includes the ServerHello message. However, extensions are not parsed to correct structures. They are left on purpose to give a somewhat easy starting point.

As a result, the following output log can be seen:

RUST_BACKTRACE=1 RUST_LOG=info cargo run cloudflare.com:443
    Finished dev [unoptimized + debuginfo] target(s) in 0.02s
     Running `target/debug/tls13tutorial 'cloudflare.com:443'`
     [INFO  tls13tutorial] Successfully connected to the server 'cloudflare.com:443'.
[INFO  tls13tutorial] Sending ClientHello as follows...

ClientHello, Length=163
  client_version=0x303
  random (len=32): 000102030405060708090A0B0C0D0E0F101112131415161718191A1B1C1D1E1F
  session_id (len=32): 000102030405060708090A0B0C0D0E0F101112131415161718191A1B1C1D1E1F
  cipher_suites (len=1)
    [0x13, 0x03] TLS_CHACHA20_POLY1305_SHA256
  compression_methods (len=1): 00
  extensions (len=88)
    extension_type=supported_versions(43), length=3
      data: 020304
    extension_type=server_name(0), length=19
      data: 001100000E636C6F7564666C6172652E636F6D
    extension_type=supported_groups(10), length=4
      data: 0002001D
    extension_type=signature_algorithms(13), length=4
      data: 00020807
    extension_type=key_share(51), length=38
      data: 0024001D00208F40C5ADB68F25624AE5B214EA767A6EC94D829D3D7B5E1AD1BA6F3E2138285F

[INFO  tls13tutorial] The handshake request has been sent...
[INFO  tls13tutorial] Received 1448 bytes of data.
[INFO  tls13tutorial] Received 1724 bytes of data.
[INFO  tls13tutorial] Response TLS Record received!
[INFO  tls13tutorial] Response TLS Record received!
[INFO  tls13tutorial] Response TLS Record received!
[INFO  tls13tutorial] Response TLS Record received!
[INFO  tls13tutorial] ServerHello message: ServerHello { legacy_version: 771, random: [169, 217, 208, 239, 149, 24, 213, 231, 132, 210, 235, 68, 75, 141, 242, 92, 239, 74, 79, 118, 237, 108, 111, 93, 25, 95, 50, 61, 29, 2, 80, 14], legacy_session_id_echo: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31], cipher_suite: CipherSuite([19, 3]), legacy_compression_method: 0, extensions: [Extension { origin: Server, extension_type: KeyShare, extension_data: Unserialized([0, 29, 0, 32, 7, 76, 95, 86, 68, 123, 126, 243, 199, 19, 16, 66, 3, 141, 58, 133, 178, 8, 117, 33, 2, 209, 26, 85, 35, 103, 112, 213, 53, 126, 112, 17]) }, Extension { origin: Server, extension_type: SupportedVersions, extension_data: Unserialized([3, 4]) }] }
[WARN  tls13tutorial] TODO: Implement the server hello message processing, and decoding of the rest of the extensions
[ERROR tls13tutorial] Unexpected response type: ChangeCipherSpec
[INFO  tls13tutorial] Application data received, size of : 3010
[WARN  tls13tutorial] TODO: Decryption of the data and decoding of the all extensions not implemented
[INFO  tls13tutorial] Application data received, size of : 19
[WARN  tls13tutorial] TODO: Decryption of the data and decoding of the all extensions not implemented

Since the server gets all the required information from the initial ClientHello request, it can start sending encrypted ApplicationData, which contains encrypted extensions and certificates.

The first steps to continue with the project are:

  • Implement decoders for missing required extensions to fully construct the extension types. Currently, extensions types are identified but their inner data is left unparsed.
  • After that, you need to construct all the traffic keys for decrypting the rest of the handshake data. Check illustration.
    • Key Derivation functions are pre-implemented. You need to:
      • Parse the server's public key and add it to the HandshakeKeys structure
      • Calculate the transcript hash
      • Use the key_schedule function to calculate the keys and use the keys afterward.
      • Implement decrypting function for CHACHA20_POLY1305_SHA256. You can use this dependency (already in the project). Note the use of a sequence counter!
      • Now you can decrypt and continue finishing the handshake process until the server provides session tickets, and go as far as you like.

You can also see the documentation of this project in the browser. Run cargo doc --open. Documentation and Display formatting pull requests are welcome - and might even give some points.

Type-Length-Value (TLV) pattern

In (binary) network communication protocols, type-length-value or tag-length-value (TLV) is a common pattern, and TLS follows it in its many sub-protocols.

For example, the lowest layer in the TLS protocol uses the so-called Record Protocol.

It wraps the higher-level protocols, by following the idea of TLV. To see that in practice, we can take a look at the TLS Record ASN.1 definition and TLSRecord as an example of Rust data structure:

pub type ProtocolVersion = u16;

// Max value is 255 (1 byte to represent)
#[derive(Debug, Copy, Clone)]
pub enum ContentType {
    Invalid = 0,
    ChangeCipherSpec = 20,
    Alert = 21,
    Handshake = 22,
    ApplicationData = 23,
}

#[derive(Debug, Clone)]
pub struct TLSRecord {
    pub record_type: ContentType,
    // 2 bytes to represent
    pub legacy_record_version: ProtocolVersion,
    // length defined as 2 bytes. Maximum value is 2^14 or 2^14 + 256 depending on the `ContentType`
    pub length: u16,
    pub fragment: Vec<u8>, // fragment of size 'length'
}

In the context of binary encoding, everything must have a size. The data should be possible to deserialize to the original structure based on the byte stream only. If we want to serialize the structure of TLSRecord into bytes, we go in order every field and create the byte presentation, as the TLS standard defines.

  • A record field takes 1 byte to present as the max value for ContentType is defined to be 255 (u8). In this case, the type defines the data content of the fragment field. When you decode the byte stream, you know that the first byte is about ContentType and takes one byte.
  • ProtocolVersion has type u16 and that takes 2 bytes to present this number. On the lower level, we differ a bit from basic TLV as the version must be defined early.
  • The length of the fragment is presented with 2 bytes. Here it has a separate field, but that is not always the case and byte arrays with dynamic size should have always a length determinant.
  • fragment is an array of bytes with a total size length based on the length of the previous.

TLS Record protocol has the plaintext version and ciphertext version, depending on the content type and the stage when a record is being used.

Basics of implementing the decoders

The base project reads all the data from the TCP stream into the VecDeque buffer. It might not be the most efficient way to parse bytes, but it allows taking chunks or single bytes without the need to move or clone the remaining data in the memory. From a learning point of view, it can be better than parsing traditional vectors or byte slices.

It provides methods such as drain or pop_front to consume parts from the buffer in the correct order. This is useful, for example, if we want to implement a decoder for a specific type. We consume as many bytes as needed to construct the object, and the leftover data is still in the original buffer, waiting to construct the follow-up object.

You can try to parse raw byte slices, but be warned about bugs! VecDeque can also panic with incorrect index usage.

Note

There is already an abstraction for VecDeque to reduce the repetition of code in parser.rs. You are free to improve this further. There can be (intentional) bugs already. However, by using Option type, we already note the most of the possible errors what can appear when extracting data from buffer.

Functional testing (positive and negative)

Implementing functional tests in Rust can be done to the same file as implementation.

This can be greatly beneficial; it is very easy to implement testing for your structs and their function implementations without ever needing to run the functionality through the main function in the development phase.

To demonstrate testing, the alert protocol is the simplest sub-protocol in TLS. Take a look for alert.rs which implements the data structures and encoders and decoders. At the end of the file, there is a test module. If you have configured your IDE correctly, you should be able to click the play ▶ button to run the single test function, for example, test_alert_from_bytes, or all the tests from the module at once.

You can also do the same from the command line:

cargo test alert::tests::test_alert_from_bytes

Since the alert protocol uses only two bytes, it is straightforward to implement both positive tests and negative tests.

If you wonder what the impl std::fmt::Display for means in the Alert data structures, it implements a textual presentation for those objects, for example, what is the output format when the println!() macro is used for the data type.

Fuzzing the project

If you are using a Linux or macOS machine for development, starting fuzz testing with precise control is very straightforward. There are many fuzzing libraries, that can be integrated into the project to get coverage-guided fuzzing.

Fuzzing provides the stream of bytes for the interface you choose, and depending on the used backend, it will provide testing data and coverage-guided mutation to test the robustness of the selected functionality.

In the fuzzing directory, the cargo fuzz is pre-configured. As one fuzzing target, the TLS Alert protocol is used as an example.

See the project and its documentation.

You need to use a Linux or macOS environment with a nightly compiler for that.

To start fuzzing, simply run

cargo fuzz run fuzz_target_1

The project has a dual target - see src/lib.rs which can be used as an interface for fuzzing (you can import everything defined in there to fuzz target). Modify as needed. You can already directly import other data structures for fuzzing.

Debugging and comparing to OpenSSL

You can use Wireshark to check the first ClientHello and ServerHello messages. Data is mostly encrypted after these, but you can still see TLS Records with Application Data type afterward, while the inner content is encrypted.

You can also see real, decrypted packets by using OpenSSL.

openssl genpkey -algorithm X25519 -out debug_key.pem # Generate X25519 negotiation key

openssl s_client -connect cloudflare.com:443 -tls1_3 \
                 -keylogfile secrets.log \
                 -key debug_key.pem \
                 -msg -msgfile messages.log \
                 -state -ciphersuites TLS_CHACHA20_POLY1305_SHA256 \
                 -debug \
                 -trace -nocommands -tlsextdebug

You can see the handshake data in the messages.log file as unencrypted.

Traffic secrets are in the secrets.log file.