bpo-45045: Optimize mapping patterns of structural pattern matching #28043

corona10 · 2021-08-29T15:58:52Z

https://bugs.python.org/issue45045

corona10 · 2021-08-29T16:00:29Z


+---------------+--------+----------------------+
| Benchmark     | base   | opt                  |
+===============+========+======================+
| bench pattern | 482 ns | 417 ns: 1.15x faster |
+---------------+--------+----------------------+

corona10 · 2021-08-29T16:16:24Z

Python/ceval.c

@@ -859,7 +859,7 @@ match_keys(PyThreadState *tstate, PyObject *map, PyObject *keys)
    if (dummy == NULL) {
        goto fail;
    }
-    values = PyList_New(0);
+    values = PyTuple_New(nkeys);


The size of the tuple is predictable.

corona10 · 2021-08-29T16:17:24Z

Python/ceval.c

@@ -873,7 +873,8 @@ match_keys(PyThreadState *tstate, PyObject *map, PyObject *keys)
            }
            goto fail;
        }
-        PyObject *value = PyObject_CallFunctionObjArgs(get, key, dummy, NULL);
+        PyObject *args[] = { key, dummy };
+        PyObject *value = PyObject_Vectorcall(get, args, 2, NULL);


Just replacing PyObject_CallFunctionObjArgs shows a 2% performance enhancement on the micro benchmark.

Fidget-Spinner · 2021-08-29T16:40:46Z

The changes LGTM. Tested locally on Win64:

python -m test test_patma -R 3:3
0:00:00 Run tests sequentially
0:00:00 [1/1] test_patma
beginning 6 repetitions
123456
......

== Tests result: SUCCESS ==

1 test OK.

BTW, I was thinking if using _PyObject_GetMethod instead of _PyObject_GetAttrId will make your benchmark faster? The diff from your current is not too large:

@@ -846,7 +846,9 @@ match_keys(PyThreadState *tstate, PyObject *map, PyObject *keys)
     // - Don't cause key creation or resizing in dict subclasses like
     //   collections.defaultdict that define __missing__ (or similar).
     _Py_IDENTIFIER(get);
-    PyObject *get = _PyObject_GetAttrId(map, &PyId_get);
+    PyObject *get_name = _PyUnicode_FromId(&PyId_get); // borrowed
+    PyObject *get = NULL;
+    int meth_found = _PyObject_GetMethod(map, get_name, &get);
     if (get == NULL) {
         goto fail;
     }
@@ -873,8 +875,14 @@ match_keys(PyThreadState *tstate, PyObject *map, PyObject *keys)
             }
             goto fail;
         }
-        PyObject *args[] = { key, dummy };
-        PyObject *value = PyObject_Vectorcall(get, args, 2, NULL);
+        PyObject *args[] = { map, key, dummy };
+        PyObject *value = NULL;
+        if (meth_found) {
+            value = PyObject_Vectorcall(get, args, 3, NULL);
+        }
+        else {
+            value = PyObject_Vectorcall(get, &args[1], 2, NULL);
+        }
         if (value == NULL) {
             goto fail;
         }

corona10 · 2021-08-29T16:50:39Z

@Fidget-Spinner
Yeah it's better!


➜  cpython git:([bpo-45045](https://bugs.python.org/issue45045)) ✗ ./python.exe -m pyperf compare_to --table base.json suggestion.json
+---------------+--------+----------------------+
| Benchmark     | base   | suggestion           |
+===============+========+======================+
| bench pattern | 482 ns | 373 ns: 1.29x faster |
+---------------+--------+----------------------+
➜  cpython git:([bpo-45045](https://bugs.python.org/issue45045)) ✗ ./python.exe -m pyperf compare_to --table opt.json suggestion.json
+---------------+--------+----------------------+
| Benchmark     | opt    | suggestion           |
+===============+========+======================+
| bench pattern | 417 ns | 373 ns: 1.12x faster |
+---------------+--------+----------------------+

corona10 · 2021-08-29T16:53:51Z

With new commit

0:00:00 load avg: 5.05 Run tests sequentially
0:00:00 load avg: 5.05 [1/1] test_patma
beginning 6 repetitions
123456
......

== Tests result: SUCCESS ==

1 test OK.

Total duration: 1.1 sec
Tests result: SUCCESS

Fidget-Spinner

LGTM. Thanks!

corona10 · 2021-08-30T10:03:40Z

@Fidget-Spinner Thanks for the review.

Here is the final benchmark with optimization build with thin LTO :)

+---------------+---------------+----------------------+
| Benchmark     | thin_lto_base | thin_lto_opt         |
+===============+===============+======================+
| bench pattern | 357 ns        | 287 ns: 1.24x faster |
+---------------+---------------+----------------------+

corona10 requested a review from markshannon as a code owner August 29, 2021 15:58

the-knights-who-say-ni added the CLA signed label Aug 29, 2021

bedevere-bot added the awaiting core review label Aug 29, 2021

corona10 added the skip news label Aug 29, 2021

corona10 requested a review from brandtbucher August 29, 2021 15:59

bpo-45045: Optimize mapping patterns of structural pattern matching

696d0bd

corona10 force-pushed the bpo-45045 branch from 183ba01 to 696d0bd Compare August 29, 2021 16:00

corona10 commented Aug 29, 2021

View reviewed changes

bpo-45045: Address code review

c95a7ea

bpo-45045 Add unbound method test case

7a30496

corona10 force-pushed the bpo-45045 branch from fd627dc to 7a30496 Compare August 29, 2021 17:43

corona10 requested a review from Fidget-Spinner August 29, 2021 17:47

bpo-45045: nit

71fe76d

Fidget-Spinner approved these changes Aug 30, 2021

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting core review labels Aug 30, 2021

corona10 merged commit e6497fe into python:main Aug 30, 2021

bedevere-bot removed the awaiting merge label Aug 30, 2021

corona10 deleted the bpo-45045 branch August 30, 2021 10:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-45045: Optimize mapping patterns of structural pattern matching #28043

bpo-45045: Optimize mapping patterns of structural pattern matching #28043

Uh oh!

corona10 commented Aug 29, 2021 •

edited by bedevere-bot

Loading

Uh oh!

corona10 commented Aug 29, 2021

Uh oh!

corona10 Aug 29, 2021

Uh oh!

corona10 Aug 29, 2021

Uh oh!

Fidget-Spinner commented Aug 29, 2021 •

edited

Loading

Uh oh!

corona10 commented Aug 29, 2021 •

edited by bedevere-bot

Loading

Uh oh!

corona10 commented Aug 29, 2021

Uh oh!

Fidget-Spinner left a comment

Uh oh!

corona10 commented Aug 30, 2021 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

bpo-45045: Optimize mapping patterns of structural pattern matching #28043

bpo-45045: Optimize mapping patterns of structural pattern matching #28043

Uh oh!

Conversation

corona10 commented Aug 29, 2021 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

corona10 commented Aug 29, 2021

Uh oh!

corona10 Aug 29, 2021

Choose a reason for hiding this comment

Uh oh!

corona10 Aug 29, 2021

Choose a reason for hiding this comment

Uh oh!

Fidget-Spinner commented Aug 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

corona10 commented Aug 29, 2021 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

corona10 commented Aug 29, 2021

Uh oh!

Fidget-Spinner left a comment

Choose a reason for hiding this comment

Uh oh!

corona10 commented Aug 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

corona10 commented Aug 29, 2021 •

edited by bedevere-bot

Loading

Fidget-Spinner commented Aug 29, 2021 •

edited

Loading

corona10 commented Aug 29, 2021 •

edited by bedevere-bot

Loading

corona10 commented Aug 30, 2021 •

edited

Loading