Performance Part I

Getting started thinking about performance

Jul 18, 2024

If you’re a seasoned developer or even just tinkering with new languages, speed is always a measuring stick. And Rust, the language that marries performance with safety, often piques the curiosity of those looking for raw speed. But how can you empirically confirm Rust’s speed? Let’s dive into how you can speed-test Rust code effectively.

The Proposition

Rust promises performance akin to C/C++ but with the added benefit of memory safety. For every aspiring system or application developer, the prospect of squeezing out every last cycle from the CPU is enticing. The joy of Rust comes from its strict compiler, which forces you to write optimized, error-free code, thus reducing runtime surprises.

Consider the simple task of summing an array of integers. In Rust, you'd

typically start with something like this:

fn sum_array(arr: &[i32]) -> i32 {

let mut sum = 0;

for &num in arr.iter() {

sum += num;

}

sum

}

fn main() {

let arr = vec![1, 2, 3, 4, 5];

println!("Sum: {}", sum_array(&arr));

}

This is straightforward, but for serious performance testing, you'd want to track how long this operation takes. Rust’s standard library makes this surprisingly easy thanks to the `Instant` type.

Measurement: The Setup

To measure time, we'll use Rust’s `std::time::Instant`. Start by recording the time just before the operation, and then again immediately after it completes. Here's an updated example:

use std::time::Instant;

fn sum_array(arr: &[i32]) -> i32 {

let mut sum = 0;

for &num in arr.iter() {

sum += num;

}

sum

}

fn main() {

let arr = vec![1, 2, 3, 4, 5];

let start = Instant::now();

let total = sum_array(&arr);

let duration = start.elapsed();

println!("Sum: {}", total);

println!("Time taken: {:?}", duration);

}

What’s compelling about Rust is that it pushes you towards efficiency. For instance, leveraging iterators and combinators can markedly improve the performance:

fn sum_array_iter(arr: &[i32]) -> i32 {

arr.iter().sum()

}

fn main() {

let arr = vec![1, 2, 3, 4, 5];

let start = Instant::now();

let total = sum_array_iter(&arr);

let duration = start.elapsed();

println!("Sum: {}", total);

println!("Time taken with iterators: {:?}", duration);

}

Beyond the Basics: Profiling

For more granular insights, consider using profiling tools. `perf` is a robust tool on Linux systems that can profile Rust applications. First, compile your Rust code with debugging information:

cargo build --release

Then profile it using `perf`:

perf record ./target/release/your_application

perf report

This delivers an expansive view of where your program spends its time down to individual lines of code.

Benchmarks: Fine-Tuning with Criterion

Benchmarking can refine our understanding of performance. Criterion.rs offers statistical rigor and ease of use. Start by adding `criterion` to your `Cargo.toml`:

[dev-dependencies]

criterion = "0.3"

Then set up a benchmark:

extern crate criterion;

use criterion::{black_box, Criterion, criterion_group, criterion_main};

fn sum_array(arr: &[i32]) -> i32 {

arr.iter().copied().sum()

}

fn criterion_benchmark(c: &mut Criterion) {

c.bench_function("sum_array", |b| {

let arr = (0..1000).collect::<Vec<_>>();

b.iter(|| sum_array(black_box(&arr)));

});

}

criterion_group!(benches, criterion_benchmark);

criterion_main!(benches);

Running your benchmarks with `cargo bench` gives you detailed results. Criterion even allows for comparing multiple versions of a function or different algorithm implementations.

Real-World Scenario

In real-world applications, Rust’s speed shines. Let’s consider a more complex task like file I/O operations combined with string processing. Here’s a snippet that reads a file and counts the frequency of each word:

use std::collections::HashMap;

use std::fs::File;

use std::io::{BufRead, BufReader, Result};

fn word_count(path: &str) -> Result<HashMap<String, usize>> {

let file = File::open(path)?;

let reader = BufReader::new(file);

let mut counts = HashMap::new();

for line in reader.lines() {

for word in line?.split_whitespace() {

*counts.entry(word.to_string()).or_insert(0) += 1;

}

Ok(counts)

}

fn main() {

let start = Instant::now();

match word_count("sample.txt") {

Ok(counts) => {

for (word, count) in counts {

println!("{}: {}", word, count);

}

Err(e) => eprintln!("Error: {:?}", e),

}

let duration = start.elapsed();

println!("Time taken: {:?}", duration);

}

This example reads a file line by line and updates a hash map with word counts. Even for I/O-bound operations, Rust offers managed concurrency to gain performance boosts further.

For tasks like this, you could combine Rayon for parallel data processing:

extern crate rayon;

use rayon::prelude::*;

use std::collections::HashMap;

fn word_count_parallel(texts: Vec<String>) -> HashMap<String, usize> {

texts.par_iter()

.flat_map(|text| text.split_whitespace())

.fold(HashMap::new, |mut acc, word| {

*acc.entry(word.to_string()).or_insert(0) += 1;

acc

})

.reduce(HashMap::new, |mut acc, map| {

for (word, count) in map {

*acc.entry(word).or_insert(0) += count;

}

acc

})

}

fn main() {

let texts = vec![

"This is a test".to_string(),

"Another test".to_string(),

"Yet another simple test".to_string(),

];

let start = Instant::now();

let counts = word_count_parallel(texts);

let duration = start.elapsed();

for (word, count) in counts {

println!("{}: {}", word, count);

}

println!("Time taken with parallelism: {:?}", duration);

}

Rust doesn't just tell you to write fast code. It nudges you towards best practices through its strict compiler checks and excellent tooling. Speed testing Rust isn’t just about timing your code; it’s about understanding and leveraging the language's paradigms. The tools available, from `std::time::Instant` to Criterion and Rayon, ensure you can both measure and achieve exceptional performance. Whether you’re summing arrays or processing large datasets, Rust provides the efficiency you need without sacrificing safety.

Rusting

Discussion about this post

Ready for more?