Skip to content

Commit

Permalink
pythongh-78214: marshal: Stabilize FLAG_REF usage (pythonGH-8226)
Browse files Browse the repository at this point in the history
Use FLAG_REF always for interned strings.

Refcounts of interned string is very unstable.
When compiling same source, refcounts of interned string in the output may be 1 or >1.
It makes FLAG_REF usage unstable.

To help reproducible build, use FLAG_REF for interned string even if refcnt(obj)==1.
  • Loading branch information
methane committed May 4, 2022
1 parent dfb1b9d commit 6dcfd6c
Show file tree
Hide file tree
Showing 3 changed files with 17 additions and 10 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
``marshal.dumps()`` uses ``FLAG_REF`` for all interned strings. This makes
output more deterministic and helps reproducible build.
16 changes: 8 additions & 8 deletions Programs/test_frozenmain.h

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

9 changes: 7 additions & 2 deletions Python/marshal.c
Original file line number Diff line number Diff line change
Expand Up @@ -298,9 +298,14 @@ w_ref(PyObject *v, char *flag, WFILE *p)
if (p->version < 3 || p->hashtable == NULL)
return 0; /* not writing object references */

/* if it has only one reference, it definitely isn't shared */
if (Py_REFCNT(v) == 1)
/* If it has only one reference, it definitely isn't shared.
* But we use TYPE_REF always for interned string, to PYC file stable
* as possible.
*/
if (Py_REFCNT(v) == 1 &&
!(PyUnicode_CheckExact(v) && PyUnicode_CHECK_INTERNED(v))) {
return 0;
}

entry = _Py_hashtable_get_entry(p->hashtable, v);
if (entry != NULL) {
Expand Down

0 comments on commit 6dcfd6c

Please sign in to comment.