In feburary 2017, CPython from Bitbucket with Mercurial moved to GitHub with Git: read [Python-Dev] CPython is now on GitHub by Brett Cannon.
In 2016, I worked on speed.python.org to automate running benchmarks and make benchmarks more stable. At the end, I had a single command to:
- tune the system for benchmarks
- compile CPython using LTO+PGO
- install CPython
- install performance
- run performance
- upload results
But my tools were written for Mercurial and speed.python.org uses Mercurial revisions as keys for changes. Since the CPython repository was converted to Git, I have to remove all old results and run again old benchmarks. But before removing everyhing, I took screenshots of the most interesting pages. It would prefer to keep a copy of all data, but it would require to write new tools and I am not motivated to do that.
Python 3.7 compared to Python 2.7
Benchmarks where Python 3.7 is faster than Python 2.7:
data:image/s3,"s3://crabby-images/6dbc3/6dbc364b10d4d5982b9161b4e7df7ae6a7795f5f" alt="python37_faster_py27"
Benchmarks where Python 3.7 is slower than Python 2.7:
data:image/s3,"s3://crabby-images/6dcd8/6dcd80b2a2c7ac1ee2a202e2fa017c25bbefb392" alt="python37_slower_py27"
Significant optimizations
CPython became regulary faster in 2016 on the following benchmarks.
call_method, the main optimized was Speedup method calls 1.2x:
data:image/s3,"s3://crabby-images/5e5cd/5e5cdf6f3476383275b23033d20660d10200752d" alt="call_method"
float:
data:image/s3,"s3://crabby-images/53de8/53de86cd3842f048200433e4917927f9be5050e9" alt="float"
hexiom:
data:image/s3,"s3://crabby-images/c6351/c635169be68f085926d250c36e69ea9eede69e72" alt="hexiom"
nqueens:
data:image/s3,"s3://crabby-images/5b3c0/5b3c07b47f88f083476a6d26bf4f4417d192d05d" alt="nqueens"
pickle_list, something happened near September 2016:
data:image/s3,"s3://crabby-images/4c746/4c746e7f7533ee52f5e5bc6dcea58eb18741a608" alt="pickle_list"
richards:
data:image/s3,"s3://crabby-images/e17fd/e17fd1df79bade1b55773b26906084254f2f1ac8" alt="richards"
scimark_lu, I like the latest dot!
data:image/s3,"s3://crabby-images/0386f/0386f911e5623a6e6bc8b4721ef870beac436626" alt="scimark_lu"
scimark_sor:
data:image/s3,"s3://crabby-images/bf666/bf66621f9a46725ab8957580576bd667ba201d43" alt="scimark_sor"
sympy_sum:
data:image/s3,"s3://crabby-images/2420e/2420ed7ebbd243ca4e57589a73413a02a349ffaf" alt="sympy_sum"
telco is one of the most impressive, it became regulary faster:
data:image/s3,"s3://crabby-images/d241b/d241b895cf8f0bbb1fc56edc10ebaaed81a182f3" alt="telco"
unpickle_list, something happened between March and May 2016:
data:image/s3,"s3://crabby-images/83d56/83d5655e71760222bed878b214a79bc1f7227900" alt="unpickle_list"
The enum change
One change related to the enum module had significant impact on the two following benchmarks.
python_startup:
data:image/s3,"s3://crabby-images/a744a/a744abee93dbea2536cbcbbc4f3e1e498224d32a" alt="python_startup"
See "Python startup performance regression" section of My contributions to CPython during 2016 Q4 for the explanation on changes around September 2016.
regex_compile became 1.2x slower (312 ms => 376 ms: +20%) because constants of the re module became enum objects: see convert re flags to (much friendlier) IntFlag constants (issue #28082).
data:image/s3,"s3://crabby-images/c80c1/c80c1d524d26f912fac7f3cb3122a0a67802f7ba" alt="regex_compile"
Benchmarks became stable
The following benchmarks are microbenchmarks which are impacted by many external factors. It's hard to get stable results. I'm happy to see that results are stable. I would say very stable compared to results when I started to work on the project!
call_simple:
data:image/s3,"s3://crabby-images/5019d/5019d72bc1a97e5600ed682ad4fec3a25f155d4f" alt="call_simple"
spectral_norm:
data:image/s3,"s3://crabby-images/90343/90343ad33180fe2f6f39eaba9143332305e1b7f9" alt="spectral_norm"
Straight line
It seems like no optimization had a significant impact on the following benchmarks. You can also see that benchmarks became stable, so it's easier to detect performance regression or significant optimization.
dulwich_log:
data:image/s3,"s3://crabby-images/14ba0/14ba0f1820143dcf60fa5b070cd6ef2848208425" alt="dulwich_log"
pidigits:
data:image/s3,"s3://crabby-images/b3341/b334121cc85183aa81ce070836e22786ab0f3511" alt="pidigits"
sqlite_synth:
data:image/s3,"s3://crabby-images/726d9/726d9039b9b0c065156101616979cddccb8643e6" alt="sqlite_synth"
Apart something around April 2016, tornado_http result is stable:
data:image/s3,"s3://crabby-images/4d054/4d054a78d6a016fcda2d69426c7bcc04b5d3eba3" alt="tornado_http"
Unstable benchmarks
After months of efforts to make everything stable, some benchmarks are still unstable, even if temporary spikes are lower than before. See Analysis of a Python performance issue to see the size of previous tempoary performance spikes.
regex_v8:
data:image/s3,"s3://crabby-images/df0d1/df0d1d76efd331b7473132ae85b2d0875ed10852" alt="regex_v8"
scimark_sparse_mat_mult:
data:image/s3,"s3://crabby-images/48479/48479126cb2dbeb26427ec35eaa2a8087e350431" alt="scimark_sparse_mat_mult"
unpickle_pure_python:
data:image/s3,"s3://crabby-images/13978/1397894732694f6564ff37bacaca34c14879d48e" alt="unpickle_pure_python"
Boring results
There is nothing interesting to say on the following benchmark results.
2to3:
data:image/s3,"s3://crabby-images/7cc6f/7cc6fed5d2dbb06b898d9f4cd3bfa9098d6fd02c" alt="2to3"
crypto_pyaes:
data:image/s3,"s3://crabby-images/7bce0/7bce08b3d250aaa872e9300f338e910230c376a0" alt="crypto_pyaes"
deltablue:
data:image/s3,"s3://crabby-images/2433d/2433d4f4eb74a68ea39ad7688e7c1870ca707ad2" alt="deltablue"
logging_silent:
data:image/s3,"s3://crabby-images/bd54e/bd54e820799a5bf798ae43098d69918d25b517fb" alt="logging_silent"
mako:
data:image/s3,"s3://crabby-images/b5ca6/b5ca6810bf90a40202b3113f6450b4e8d281a46c" alt="mako"
xml_etree_process:
data:image/s3,"s3://crabby-images/f7a80/f7a802f9b56b729bcba61673b70948082934143b" alt="xml_etree_process"
xml_etre_iterparse:
data:image/s3,"s3://crabby-images/51eb5/51eb55cb692cbd14e31ec5e56e7d821ee43befa6" alt="xml_etre_iterparse"