Making it Easy with Rayon
A library named "Rayon" is the gold-standard for easy thread-based concurrency in Rust. It actually uses another crate (crossbeam
) under the hood, but it provides a much simpler interface for the most common use cases. Rayon can help you with a lot of tasks. Let's work through using it.
Parallel Iterators
This example is in the
rayon_par_iter
directory, in the02_threads
directory.
Let's start by adding Rayon to the project:
cargo add rayon
Probably the nicest addition Rayon bring is par_iter
. The majority of things you can do with an iterator, you can auto-parallelize with par_iter
. For example:
use rayon::prelude::*; fn main() { let numbers: Vec<u64> = (0 .. 1_000_000).collect(); let sum = numbers.par_iter().sum::<u64>(); println!("{sum}"); }
Rayon creates a thread-pool (1 thread per CPU), with a job queue. The queue implements work-stealing (no idle threads), and supports "sub-tasks" - a task can wait for another task to complete. It really is as simple as using par_iter()
(for an iterator of references), par_iter_mut()
(for an iterator of mutable references), or into_par_iter()
(for an iterator of values that moves the values).
Let's do another test, this time with nested tasks. We'll use a really inefficient function for finding prime numbers:
#![allow(unused)] fn main() { fn is_prime(n: u32) -> bool { (2 ..= n/2).into_par_iter().all(|i| n % i != 0 ) } }
And we can parallelize a search of a million numbers as follows:
#![allow(unused)] fn main() { // Print primes below 1,000 let now = Instant::now(); let numbers: Vec<u64> = (2 .. 1_000_000).collect(); let elapsed = now.elapsed(); let mut primes: Vec<&u64> = numbers.par_iter().filter(|&n| is_prime(*n as u32)).collect(); primes.sort(); println!("{primes:?}"); println!("It took {} ms to find {} primes", elapsed.as_millis(), primes.len()); }
The result on my PC is: It took 4 ms to find 78498 primes
. In other words, it took longer to sort and print the result than it did to calculate it.
Fortunately, Rayon can parallelize the sort, too:
#![allow(unused)] fn main() { primes.par_sort_unstable(); }
That knocked a few more milliseconds off the time.