Calling Rust code from Python is made easy by PyO3. You can write a Rust library and rely on the combination of PyO3 and maturin
, a supporting tool from the PyO3
ecosystem, to compile the Rust library and have it installed directly as a Python module. Among others, PyO3 can translate types between Python and Rust as well as make it easy to export Rust functions to Python through a set of macros.
In this blog post, I will give a short introduction to PyO3. After that, I will discuss several example functions, written in Rust and called from Python. These examples include:
- calculate the n-th Fibonacci in Python as well as in Rust
- having Python use a variety of types in Rust functions
- using a Rust struct in Python code
- using Python to send JSON to Rust and serialize that JSON as a struct
- allow Rust to use the logger from the Python runtime
- generating an Error in Rust and catching it as an exception in Python
Introduction to PyO3
PyO3 offers some ergonomics for people wanting to glue Rust and Python code together. It helps you calling Python code from Rust as well as calling Rust code from Python. Since I have been using it only to call Rust code from Python, that is the only thing I will write about here.
So what does PyO3 give you?
For starters, there is maturin. This tool will compile the Rust code for you and install the compiled code as a Python module in your virtual environment. After this, you can import this module in your Python code and use it. After you pip install maturin, you only have to run 1 command (maturin develop) to use the Rust code in Python.
Apart from maturin
, there is of course PyO3 itself. PyO3 offers Rust bindings to the Python interpreter. This makes it so that you do not really have to bother that much with the interaction between Python and Rust. For instance, you will not have to worry about how to translate a Python string to something in C, and then something else again in Rust. The same goes for integers, floats, lists, dictionaries, etc. And to make things convenient, PyO3 comes with a lot of macros that prevent you from having to write too much boilerplate code. To expose Rust functions to Python, you annotate them with a macro. After this, PyO3 will take care of the rest. The same applies in case you want to export a struct or methods.
Calling a Rust function from Python
In this first example, we’ll call a Rust multiplication function from Python. Normally, we could write something like this:
fn multiply(a: isize, b: isize) -> isize {
a * b
}
Without adding too many things, we can make this function callable from Python. For starters, we need to:
- bring in the
pyo3
prelude - annotate the function with
#[pyfunction]
to turn it into a PyCFunction - wrap the result in a
PyResult
- add the function to the
#[pymodule]
The following code follows illustrates the above steps in the same order:
use pyo3::prelude::*;
#[pyfunction]
fn multiply(a: isize, b: isize) -> PyResult<isize> {
Ok(a * b)
}
#[pymodule]
fn rust(_py: Python, m: &PyModule) -> PyResult<()> {
m.add_function(wrap_pyfunction!(multiply, m)?)?;
Ok(())
}
The above will expose the multiply function in a Python module called rust
(after the name of the last function). We also need to put in place the proper Cargo.toml file.
To make things easy, make sure that the name of the library in Cargo.toml matches the name of the function that was annotated with #[pymodule]
. In my example, I put the following in my Cargo.toml
:
[lib] name = "rust"
When these two names match, the maturin
build tool will install the Rust library as a Python module using that name.
So, in this case, having chosen rust
as the name of the package, we can write the following Python to call the multiply function:
import rust
result = rust.multiply(2, 3)
print(result)
To be able to run this code, we need to compile the Rust code and install it as a Python library. This is where maturin
comes in:
root@rust:/# git clone https://github.com/saidvandeklundert/pyo3.git Cloning into 'pyo3'... remote: Enumerating objects: 36, done. ... Resolving deltas: 100% (9/9), done. root@rust:/# cd pyo3/multiply/ root@rust:/pyo3/multiply# python3 -m venv .env root@rust:/pyo3/multiply# source .env/bin/activate (.env) root@rust:/pyo3/multiply# pip install maturin Collecting maturin ... Installing collected packages: toml, maturin Successfully installed maturin-0.11.5 toml-0.10.2 (.env) root@rust:/pyo3/multiply# maturin develop 🔗 Found pyo3 bindings 🐍 Found CPython 3.9 at python Compiling proc-macro2 v1.0.32 ... Compiling multiply v0.1.0 (/pyo3/multiply) Finished dev [unoptimized + debuginfo] target(s) in 23.48s (.env) root@rust:/pyo3/multiply# python3 multiply.py 6
That was it!
Now I can run this script:
(.env) root@rust: multiply# python3 multiply.py 6
Calculating the n-th Fibonacci number in Python and in Rust
To calculate the n-th Fibonacci number, I will use a Python and Rust function that are very similar to each other. The following is the Python function:
def get_fibonacci(number: int) -> int:
"""Get the nth Fibonacci number."""
if number == 1:
return 1
elif number == 2:
return 2
total = 0
last = 0
current = 1
for _ in range(1, number):
total = last + current
last = current
current = total
return total
To add the Rust equivalent, we need to do the following:
- create the function in our
lib.rs
- annotate the function with
#[pyfunction]
- add the function to the module
This is illustrated in the following code snippet:
#[pyfunction]
fn get_fibonacci(number: isize) -> PyResult<u128> {
if number == 1 {
return Ok(1);
} else if number == 2 {
return Ok(2);
}
let mut sum = 0;
let mut last = 0;
let mut curr = 1;
for _ in 1..number {
sum = last + curr;
last = curr;
curr = sum;
}
Ok(sum)
}
#[pymodule]
fn rust(_py: Python, m: &PyModule) -> PyResult<()> {
m.add_function(wrap_pyfunction!(get_fibonacci, m)?)?;
Ok(())
}
After pulling in the new code, we can use maturin to build and install the module and do a comparison:
(.env) root@rust:/pyo3/pyo3# maturin develop (.env) root@rust:/pyo3/pyo3# python -i Python 3.9.2 (default, Feb 28 2021, 17:03:44) [GCC 10.2.1 20210110] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import timeit >>> >>> timeit.timeit("get_fibonacci(5)", setup="from fib import get_fibonacci") 0.49461510000401177 >>> >>> timeit.timeit("get_fibonacci(5)", setup="from rust import get_fibonacci") 1.1281064000068 >>> >>> >>> timeit.timeit("get_fibonacci(150)", setup="from fib import get_fibonacci") 9.057604000001447 >>> timeit.timeit("get_fibonacci(150)", setup="from rust import get_fibonacci") 3.5204217999998946
The above tells us that when we call the function to calculate the 5-th Fibonacci number, Python is faster. But when we look for the 150th Fibonacci number, Rust is almost three times faster.
But it gets better.
We can also do a release build by adding --release
as an argument to maturin:
(.env) root@rust:/pyo3/pyo3# python -i Python 3.9.2 (default, Feb 28 2021, 17:03:44) [GCC 10.2.1 20210110] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import timeit >>> timeit.timeit("get_fibonacci(5)", setup="from fib import get_fibonacci") 0.4583319000012125 >>> timeit.timeit("get_fibonacci(5)", setup="from rust import get_fibonacci") 0.11867309999797726 >>> timeit.timeit("get_fibonacci(150)", setup="from fib import get_fibonacci") 8.990601400000742 >>> timeit.timeit("get_fibonacci(150)", setup="from rust import get_fibonacci") 0.15236040000309004 >>>
With the release build, the Rust function is a lot faster in both cases.
I have also compared a lot of other functions. Scanning a text for a substring, manipulating texts, summing up numbers and a variety of other things. I have found that this type of performance increase is not that typical. Getting things to speed up 2x or 3x is usually not that hard.
Working with different types
When calling Rust function, PyO3 will convert the Python type you pass as function arguments to Rust types for you. It also converts the Rust types that Rust functions return to types that are usable in Python. The PyO3 user guide describes the way the mapping between the Python types and the Rust types is done. Having PyO3 do this automatically for you makes it easy to work with function arguments of different types. Next up are a few examples to illustrate this.
returning the sum of the numbers in a list:
The following function will sum the numbers in a vector and return the result:
#[pyfunction]
fn list_sum(a: Vec<isize>) -> PyResult<isize> {
let mut sum: isize = 0;
for i in a {
sum += i;
}
Ok(sum)
}
#[pymodule]
fn rust(_py: Python, m: &PyModule) -> PyResult<()> {
m.add_function(wrap_pyfunction!(list_sum, m)?)?;
Ok(())
}
When we call this function in Python, we can pass in a list of integers and we get a Python integer in return:
>>> rust.list_sum([10, 10, 10, 10, 10])
50
Let’s do one last performance comparison. In this case, we can see that it only makes sense to try and speed things up if the function has to perform a lot of computations. The following example shows the difference in performance with a small input to the function:
>>> timeit.timeit("rust.list_sum(a_list)", setup="""
... import rust
... a_list = [x for x in range(1,10)]
... """)
0.42956949999643257
>>>
>>> timeit.timeit("sum_list(a_list)", setup="""
... from __main__ import sum_list
... a_list = [x for x in range(1,10)]
... """)
0.4579178999993019
Hardly any difference. Now, when we increase the function input to 3.000 numbers, we can start seeing some real advantage over using the Rust function:
>>> timeit.timeit("sum_list(a_list)", setup="""
... from __main__ import sum_list
... a_list = [x for x in range(1,3000)]
... """)
168.12326449999819
>>> timeit.timeit("rust.list_sum(a_list)", setup="""
... import rust
... a_list = [x for x in range(1,3000)]
... """)
95.2027356000035
>>>
Key takeaway is that how much you’ll be able to speed things up really depends on what part of the code you outsource from Python to Rust. I’ll skip any further comparisons between Rust and Python and focus on a few more scenario’s that I think are worthwhile.
printing the values of a dict:
The next function will print the key and the values of a HashMap:
use std::collections::HashMap;
#[pyfunction]
fn dict_printer(hm: HashMap<String, String>) {
for (key, value) in hm {
println!("{} {}", key, value)
}
}
#[pymodule]
fn rust(_py: Python, m: &PyModule) -> PyResult<()> {
m.add_function(wrap_pyfunction!(dict_printer, m)?)?;
Ok(())
}
We can now call this function from Python, passing in a dictionary that is working with the same types:
>>> a_dict = {
... "key 1": "value 1",
... "key 2": "value 2",
... "key 3": "value 3",
... "key 4": "value 4",
... }
>>>
>>> rust.dict_printer(a_dict)
key 2 value 2
key 1 value 1
key 3 value 3
key 4 value 4
Note that the HashMap in Rust is different from a Python dictionary. Since we defined the Rust HashMap to use Strings for both key as well as value, we have to do the same thing in our Python code. If we use any other type as key or value, the function call will fail. Rust will try and coerce a type into another:
>>> try:
... rust.dict_printer({"a": 1, "b": 2})
... except TypeError as e:
... print(f"Caught a type error: {e}")
...
Caught a type error: argument 'hm': 'int' object cannot be converted to 'PyString'
>>>
printing a word n times
The following function will print a word several times. Additionally, it can also print the word in reverse and/or in uppercase:
#[pyfunction]
fn word_printer(mut word: String, n: isize, reverse: bool, uppercase: bool) {
if reverse {
let mut reversed_word = String::new();
for c in word.chars().rev() {
reversed_word.push(c);
}
word = reversed_word;
}
if uppercase {
word = word.to_uppercase();
}
for _ in 0..n {
println!("{}", word);
}
}
The example is a bit lame, but it shows that writing functions that take in a variety of types as arguments is not too difficult:
>>> rust.word_printer("hello", 3, False, True)
HELLO
HELLO
HELLO
>>> rust.word_printer("eyb", 2, True, False)
bye
bye
Using a Rust struct in Python
Much to my surprise, PyO3 also makes it increadibly easy to use a Rust struct in Python. Though I have not exhausted or tested all possibilities and corner-cases, using a struct with several methods is also pretty straightforward. I made the following example:
#[pyclass]
pub struct RustStruct {
#[pyo3(get, set)]
pub data: String,
#[pyo3(get, set)]
pub vector: Vec<u8>,
}
#[pymethods]
impl RustStruct {
#[new]
pub fn new(data: String, vector: Vec<u8>) -> RustStruct {
RustStruct { data, vector }
}
pub fn printer(&self) {
println!("{}", self.data);
for i in &self.vector {
println!("{}", i);
}
}
pub fn extend_vector(&mut self, extension: Vec<u8>) {
println!("{}", self.data);
for i in extension {
self.vector.push(i);
}
}
}
#[pymodule]
fn rust(_py: Python, m: &PyModule) -> PyResult<()> {
m.add_class::<RustStruct>()?;
Ok(())
}
Notice the use of 4 new annotations:
#[pyclass]
: above the struct definition, used to expose the class in Python.#[pymethods]
: above theimpl
block, used to expose the struct methods in Python as class methods.#[pyo3(get, set)]
: use these macros in case you want to be able to get or set the struct fields in Python.#[new]
: above the constructor, this is to be able to contstruct the struct as a class in Python.
Additionally, we add the struct to the module in a slightly different way. Instead if using the following:
m.add_function(wrap_pyfunction!(xxx, m)?)?;
We now used the following:
m.add_class::<RustStruct>()?; // inserted the name of the struct that is to be exported here
After running maturin develop
again, we are ready to use the struct in our Python. We can use the struct as though it is a Python class:
>>> rust_struct = rust.RustStruct(data="some data", vector=[255, 255, 255])
>>> rust_struct.extend_vector([1, 1, 1, 1])
Extending the vector.
>>> rust_struct.printer()
some data
255
255
255
1
1
1
1
>>> type(rust_struct)
<class 'builtins.RustStruct'>
>>> rust_struct.data
'some data'
>>> rust_struct.data = "some other data"
>>> rust_struct.data
'some other data'
Sending Json over to Rust
One way to send over complex data structures could be by using JSON. The following example marshals a JSON-string into a Rust struct:
extern crate serde;
extern crate serde_json;
use serde::{Deserialize, Serialize};
#[pyfunction]
fn human_says_hi(human_data: String) {
println!("{}", human_data);
let human: Human = serde_json::from_str(&human_data).unwrap();
println!(
"Now we can work with the struct:\n {:#?}.\n {} is {} years old.",
human, human.name, human.age,
)
}
#[derive(Debug, Serialize, Deserialize)]
struct Human {
name: String,
age: u8,
}
On the Python side, I will use Pydantic to send over some Json:
>>> from pydantic import BaseModel
>>>
>>> class Human(BaseModel):
... name: str
... age: int
...
>>>
>>> jan = Human(name="Jan", age=6)
>>> print(jan.json())
{"name": "Jan", "age": 6}
>>> rust.human_says_hi(jan.json())
{"name": "Jan", "age": 6}
Now we can work with the struct:
Human {
name: "Jan",
age: 6,
}.
Jan is 6 years old.
This might be abn approach to map a large struct to a datastructure that is similar on the Python side. I used pydantic
because it is one of my favourite Python packages. The .json()
method I used here convert the class to a JSON string.
Have Rust use a logger from the Python runtime
Rust can be made to log using the logger we define in Python. This is possible using pyo3-log
. We start off by adding the following to the Cargo.toml`:
[dependencies] pyo3-log = "0.5.0" log = "0.4.14"
Next, we create 2 example functions that will log a message:
use log::{debug, error, info, warn};
use pyo3_log;
#[pyfunction]
fn log_different_levels() {
error!("logging an error");
warn!("logging a warning");
info!("logging an info message");
debug!("logging a debug message");
}
#[pyfunction]
fn log_example() {
info!("A log message from {}!", "Rust");
}
#[pymodule]
fn rust(_py: Python, m: &PyModule) -> PyResult<()> {
pyo3_log::init();
m.add_wrapped(wrap_pyfunction!(log_example))?;
m.add_wrapped(wrap_pyfunction!(log_different_levels))?;
Ok(())
}
Then, after we define a logger in Python, we can use it in Rust:
>>> import logging
>>> FORMAT = "%(levelname)s %(name)s %(asctime)-15s %(filename)s:%(lineno)d %(message)s"
>>> logging.basicConfig(format=FORMAT)
>>> logging.getLogger().setLevel(logging.DEBUG)
>>> logging.info("Logging from the Python code")
INFO root 2021-11-15 20:24:46,979 <stdin>:1 Logging from the Python code>>> rust.log_example()
INFO rust 2021-11-15 20:24:51,336 lib.rs:118 A log message from Rust!
>>> rust.log_different_levels()
ERROR rust 2021-11-15 20:24:55,212 lib.rs:110 logging an error
WARNING rust 2021-11-15 20:24:55,212 lib.rs:111 logging a warning
INFO rust 2021-11-15 20:24:55,213 lib.rs:112 logging an info message
DEBUG rust 2021-11-15 20:24:55,213 lib.rs:113 logging a debug message
>>>
Catch an exception
In this last example, we will raise an error in Rust and catch that as an exception in Python. This part is quite involved, the steps I took are shown after the code snippet:
use std::fmt;
// 1
#[derive(Debug)]
struct MyError {
pub msg: &'static str,
}
// 2
impl std::error::Error for MyError {}
// 3
impl fmt::Display for MyError {
fn fmt(&self, f: &mut fmt::Formatter) -> fmt::Result {
write!(f, "Error from Rust: {}", self.msg)
}
}
// 4
impl std::convert::From<MyError> for PyErr {
fn from(err: MyError) -> PyErr {
PyOSError::new_err(err.to_string())
}
}
#[pyfunction]
// 5
fn greater_than_2(number: isize) -> Result<isize, MyError> {
if number <= 2 {
return Err(MyError {
msg: "number is less than or equal to 2",
});
} else {
return Ok(number);
}
}
#[pymodule]
fn rust(_py: Python, m: &PyModule) -> PyResult<()> {
// 6
m.add_function(wrap_pyfunction!(greater_than_2, m)?)?;
Ok(())
}
1: We define ‘MyError’ as a custom Error. The struct has a field that is used to be able to send a custom message. 2: The ‘Error’ trait is implemented for ‘MyError’. 3: We implement the ‘Display’ trait for ‘MyError’ and have it display the message from the ‘msg’ field in the ‘MyError’ struct. 4: The ‘From’ trait is implemented for ‘MyError’. This trait is used to do value-to-value conversions while consuming the input value. 5: We create a function called ‘greater_than_2’. This function will raise the error/exception in case the input value is 2 or less. 6: We add the function to the Python module in the ‘regular’ way.
Now we move to the Python side and run the function, triggering the exception:
>>> rust.greater_than_2(1)
Traceback (most recent call last):
...
OSError: Error from Rust: number is less than or equal to 2
>>> rust.greater_than_2(3)
3
>>> rust.greater_than_2(11)
11
>>> rust.greater_than_2(-11)
Traceback (most recent call last):
...
OSError: Error from Rust: number is less than or equal to 2
It took quite a bit of code to put that in! Still not too hard in my opinion.
Closing thoughts.
Working with PyO3 and using it to call Rust from Python is a very pleasant experience. I find that the ergonomics that have been put in place make everything a lot easier when compared to relying on ffi
or libc
. The macros offer a lot of convenience and having PyO3 translate the types between Rust and Python brings a lot of ease to the development process. And then there is the build tool maturin
. This is an absolute joy to work with.
The repo is actively maintained, and the documentation is great. After learning about PyO3, I am excited about using Rust in my Python.
The code samples from this blog can be found here. There is also a Docker build file there in case you want to run the code yourself.
Some other worthwhile links in the context of PyO3: