Pystats docs #650

mdboom · 2024-02-06T20:15:43Z

This is a first pass at trying to document the fields in the pystats. The most important ones (and the ones I'm most familiar with) are the Tier 2-related ones, so I've started with those and we can come back to the other ones later.

This probably belongs in CPython proper, but we can iterate here and submit there later.

gvanrossum

Good start!

pystats_docs.md

gvanrossum · 2024-02-06T23:08:12Z

pystats_docs.md

+
+## Specialization stats
+
+TBD


Can we ask @markshannon tp help fill these TBD entries?

For each family, there are three tables:

Kind

Lists the number of "deferred" (i.e. not specialized) instructions executed, the number of "hits" (specialized instructions that complete) and "misses" (specialized instructions that deopt).

Unnamed

Success: Number of specialization attempts that were successful
Failure: Number of specialization attempts that failed for some reason

Failure kind

Numbers for the various kinds of specialization failures.
The total should add up to the "Failure" entry in the above table.

~~@markshannon: For the Kind table, I see "specialization.deferred", "specialization.deopt", "hit" and "miss". What is the difference between "deopt" and "miss"?~~

EDIT: Nevermind -- I was looking at some old results.

@mdboom Can you add this to the doc?

pystats_docs.md

mdboom · 2024-02-07T15:30:00Z

It occurs to me that what we really ought to do is put this information in the generated output from summarize_stats.py. This PR is still a convenient way to iterate on the language though.

gvanrossum

A few more…

pystats_docs.md

Co-authored-by: Guido van Rossum <[email protected]>

gvanrossum · 2024-02-14T16:53:40Z

Blocked on @markshannon answering some inline questions.

markshannon · 2024-02-15T11:51:17Z

pystats_docs.md

+
+TBD
+
+## Specialization effectiveness


Specialization effectiveness

All entries are execution counts. Should add up to the total number of T1 instructions executed.
Basic: Instructions that are not and cannot be specialized, e.g. LOAD_FAST.
Not specialized: Instructions could be specialized but aren't. E.g. LOAD_ATTR, BINARY_SLICE
Specialized hits: Specialized instructions, e.g. LOAD_ATTR_MODULE that complete
Specialized misses: Specialized instructions, e.g. LOAD_ATTR_MODULE that deopt

I think there is a bug in the "Not specialized" numbers. We are counting POP_JUMP_IF_... but we shouldn't be.

Deferred by instruction

Breakdown of deferred (not specialized) instruction counts by family.

Misses by instruction

Breakdown of misses (specialized deopts) instruction counts by family.

I filed a CPython issue for the not specialized numbers problem: python/cpython#115521

markshannon · 2024-02-15T12:08:42Z

pystats_docs.md

+
+## Call stats
+
+TBD


This is shows what fraction of calls to Python functions are inlined (no call at the C level) and for those that are not, where the call comes from. The various categories overlap.

Also includes count of frame objects created.

Read the code for more details.

markshannon · 2024-02-15T12:12:37Z

pystats_docs.md

+
+## Object stats
+
+TBD


Grab bag of stats about objects:

"Allocations" means "allocations that are not from a freelist". Total allocations = "Allocations from freelist" + "Allocations"

"New values" is the number of values array created for objects with managed dicts.

The cache hit/miss numbers are for the MRO cache, split into dunder and other names.

markshannon · 2024-02-15T12:13:32Z

pystats_docs.md

+
+## GC stats
+
+TBD


By generation. Collected/visits gives some measure of efficiency

mdboom · 2024-02-15T14:53:16Z

I've incorporated @markshannon's content and will go ahead and merge (this is just a local reference -- we can always iterate).

gvanrossum · 2024-02-15T15:11:27Z

pystats_docs.md

+
+This is the count of how many times each Tier 1 instruction is executed.
+
+The "miss ratio" column shows the percentage of times when instruction executed that it deoptimized. In this case the base unspecialized instruction is not counted.


Maybe add "The deoptimization event is counted separately, see below".

Sorry I merged too quickly. Added here: #655

Based on this comment: #650 (comment)

mdboom requested review from gvanrossum and markshannon February 6, 2024 20:15

mdboom force-pushed the pystats-docs branch from b01408e to a3087cb Compare February 6, 2024 20:16

Add pystats docs

35572b2

mdboom force-pushed the pystats-docs branch from a3087cb to 35572b2 Compare February 6, 2024 20:16

gvanrossum reviewed Feb 6, 2024

View reviewed changes

mdboom added 2 commits February 7, 2024 10:14

Updates based on feedback

1a21ec9

Updates based on feedback

c674215

gvanrossum reviewed Feb 7, 2024

View reviewed changes

pystats_docs.md Outdated Show resolved Hide resolved

pystats_docs.md Outdated Show resolved Hide resolved

pystats_docs.md Outdated Show resolved Hide resolved

mdboom and others added 3 commits February 7, 2024 13:07

Update pystats_docs.md

04c56ee

Co-authored-by: Guido van Rossum <[email protected]>

Update pystats_docs.md

1d0b38a

Co-authored-by: Guido van Rossum <[email protected]>

Update pystats_docs.md

2f17739

Co-authored-by: Guido van Rossum <[email protected]>

mdboom mentioned this pull request Feb 12, 2024

gh-115362: Add documentation to pystats output python/cpython#115365

Merged

markshannon reviewed Feb 15, 2024

View reviewed changes

pystats_docs.md Outdated

## GC stats

TBD

Copy link

Member

markshannon Feb 15, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

By generation. Collected/visits gives some measure of efficiency

mdboom mentioned this pull request Feb 15, 2024

pystats specialization effectiveness numbers aren't correct python/cpython#115521

Open

Add information from @markshannon

b0659da

mdboom merged commit 99b09bb into faster-cpython:main Feb 15, 2024

gvanrossum reviewed Feb 15, 2024

View reviewed changes

mdboom added a commit that referenced this pull request Feb 15, 2024

Add cross reference from execution counts to specialization stats

fd994fa

Based on this comment: #650 (comment)

mdboom mentioned this pull request Feb 15, 2024

Add cross reference from execution counts to specialization stats #655

Merged

mdboom added a commit that referenced this pull request Feb 15, 2024

Add cross reference from execution counts to specialization stats (#655)

8bfd04b

Based on this comment: #650 (comment)


		This is the count of how many times each Tier 1 instruction is executed.

		The "miss ratio" column shows the percentage of times when instruction executed that it deoptimized. In this case the base unspecialized instruction is not counted.

Pystats docs #650

Pystats docs #650

Uh oh!

Conversation

mdboom commented Feb 6, 2024

Uh oh!

gvanrossum left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Kind

Unnamed

Failure kind

Uh oh!

mdboom Feb 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mdboom commented Feb 7, 2024

Uh oh!

gvanrossum left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gvanrossum commented Feb 14, 2024

Uh oh!

markshannon Feb 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Specialization effectiveness

Deferred by instruction

Misses by instruction

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mdboom commented Feb 15, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mdboom Feb 15, 2024 •

edited

Loading

markshannon Feb 15, 2024 •

edited

Loading