Updated memory management #543

samschott · 2024-11-19T22:26:50Z

This PR changes the memory management model of Rubicon. Previously, we would release objects on Python __del__ calls only if we created them ourselves and own them from alloc and similar calls.

In this PR, we always ensure that we own objects when we create a Python wrapper, by explicitly calling retain if we did not get them from alloc etc and always calling autorelease when the Python wrapper is garbage collected. This has a few advantages:

Users will no longer need to manually retain objects when receiving them from non-alloc methods and to release them before the Python object goes out of scope to avoid memory leaks.
Unless users manually release an object, Rubicon guarantees that a Python wrapper always points to an existing Objective-C object -- the one that it was created for.

This change should be backward compatible for most users because existing manual retain and release calls don't cause any issues if balanced and would have already caused segfaults if there are more releases than retains.

TODO:

Update docs.
Test with toga.
Figure out a decent transition plan for clients.

PR Checklist:

All new features have been tested
All new features have been documented
I have read the CONTRIBUTING.md file
I will abide by the code of conduct

samschott · 2024-11-19T22:35:00Z

@mhsmith, care to have first look? Please note the TODOs still listed above.

freakboy3742

This turned out to be a lot less invasive than I thought it would be.

I've done a check of the toga-chart issue that triggered this change; and it seems to resolve that problem.

I also did a quick check with Toga's testbed suite on macOS; that code has a bunch of manual retains and autorelease/releases. I thought they should all be balanced though - worst case, objects would be over-retained - so I was a little surprised that it the testbed segfaults almost immediately (and the stack trace doesn't give any obvious pointers what is causing the issue).

If I remove all the retains and releases, the testbed segfaults; but on inspection, some of the uses are for objects that are created in Python, then handed to ObjC to manage (e.g., Toolbar items created here), or the copyWithZone handler here). But I guess those uses of memory handling make sense - and they're a lot closer aligned to the "spirit" of ObjC memory handling, Plus, in at least the ToolbarItem case, it could be avoided by keeping the toolbar instance in the cache of items.

freakboy3742 · 2024-11-20T06:13:20Z

Related - if we land this, I suspect a version bump to 0.5 might be called for. This is just backwards incompatible enough that I think it's worth flagging the significance of the change.

samschott · 2024-11-20T09:43:37Z

Thanks for the thorough checks! I've updated the PR description to give a better summary of the change and also discuss why this should be non-breaking for most users.

I'll have a closer look at the segfaults that you encountered, later. They might be caused by the usage of release instead of autorelease in __del__ which does not give Objective-C a chance to take over ownership.

Maybe there are also ways to prevent users from shooting themselves in the foot, e.g., raise an exception on manual release calls if there is only a single reference left.

mhsmith · 2024-11-20T13:40:29Z

Thanks, this looks great. I'm busy today, but I'll take a look at this as soon as I can.

samschott · 2024-11-23T11:52:54Z

I've had a closer look now at the segfaults and could identify two cases where they happen:

The object is released by Rubicon but still needed in ObjC, for example because it is being assigned to a property. Replacing release by autorelease in __del__ fixes this by giving ObjC a chance to take over ownership.
Toga has a few "stray" release or autorelease calls sprinkled throughout the codebase to manually clean up memory, for example because of point (1). This relies on Rubicon internally setting _needs_release = False after those calls and disabling its own cleanup logic. This no longer works and results in one too many release calls now.

beeware/toga#2978 contains all the changes that I found to (1) prevent segfaults and (2) remove now unneeded manual memory management.

mhsmith

I've added #539 and #48 to the Fixes list in the top comment.

docs/how-to/memory-management.rst

src/rubicon/objc/api.py

changes/256.bugfix.rst

src/rubicon/objc/api.py

tests/test_core.py

Co-authored-by: Malcolm Smith <[email protected]>

changes/256.removal.rst

src/rubicon/objc/api.py

changes/256.bugfix.rst

tests/test_core.py

Co-authored-by: Malcolm Smith <[email protected]>

freakboy3742 · 2024-11-25T21:35:34Z

@samschott FYI - I'm on the last day of PyCon AU sprints; I might not get a chance to look at this until tomorrow when I'm back in my office for a return to semi-normal business.

samschott · 2024-11-25T21:47:17Z

No worries. I've gotten a quite a thorough review from @mhsmith in the meantime, but will wait for both of your approvals before merging. Enjoy PyCon AU!

… an additional refcount

samschott · 2024-11-25T22:34:50Z

src/rubicon/objc/api.py

+                # it here to prevent leaking memory, Python already owns a refcount from
+                # when the item was put in the cache.
+                if _returned_from_method.startswith(_OWNERSHIP_METHOD_PREFIXES):
+                    send_message(object_ptr, "release", restype=objc_id, argtypes=[])


I worry a bit that the mental load of following when we retain and release is a bit much now since the reader needs to think though the __new__ method being invoked multiple times with the same pointer.

Alternatives such as explicitly tracking a "Python refcount" might be easier to understand but would also add complexity.

Suggestions are welcome.

I'm fine with this approach: if it's possible to deal with the situation immediately after the copy, that's definitely easier to follow than maintaining an extra refcount.

I agree. An extra refcount starts to get into re-implementing garbage collection territory, and I'm not sure that's a game that is worth it. I'm comfortable with this as a known edge case, and documenting it with some related usage patterns and advice.

…om object

… is None

samschott · 2024-11-26T18:00:25Z

I've made one more change, to accommodate how copyWithZone: is often implemented on immutable objects such as NSString or NSDictionary where the original object is returned but with an additional retain count. When an already cached object is returned from copy and related methods, this retain count is now released immediately.

samschott · 2024-11-26T18:13:03Z

There is yet another hairy issue with the current implementation which will lead to memory leaks: alloc().init() chains where init changes the object's memory address, as documented at https://developer.apple.com/documentation/objectivec/nsobject/1418641-init?language=objc:

In some cases, a custom implementation of the init method might return a substitute object. You must therefore always use the object returned by init, and not the one returned by alloc or allocWithZone:, in subsequent code.

This is the case for example with NSDictionary.alloc().init() where the init returns a different object. The catch is that the object returned by init in this case seems to be owned by the caller ~~as well~~ (not ~~only~~ the object returned by alloc) and we start leaking memory. Example code with the implementation from this PR:

from rubicon.objc import *
from rubicon.objc.runtime import autoreleasepool


with autoreleasepool():
  adict = NSDictionary.alloc()
  idict = adict.initWithObjects(["some_object"], forKeys=["some_key"])

assert idict.retainCount() == 2  # Retained manually by Rubicon and implicitly.
print(adict)  # segaults because adict was deallocated

The above has two problems:

It leads to segfaults if the user attempts to use the allocated object after the autorelease pool was drained.
It leaks memory because the initialized object is retained twice but Rubicon is only aware of one of the retains.

Edit: Segfault seems to be for different reasons.

src/rubicon/objc/api.py

tests/test_core.py

src/rubicon/objc/api.py

Co-authored-by: Malcolm Smith <[email protected]>

samschott · 2024-11-26T20:32:49Z

Looks like my assessment from #543 (comment) was not quite correct:

The adict object does not seem to be deallocated, the print statement seems to segfault for other reasons.

mhsmith · 2024-11-26T21:05:33Z

NSDictionary.alloc().init() where the init returns a different object

The above has two problems:

It leads to segfaults if the user attempts to use the allocated object after the autorelease pool was drained.

It leaks memory because the initialized object is retained twice but Rubicon is only aware of one of the retains.

I guess then we'll also have to special-case methods starting with init. If init returns a different object to the one it was called on, then instead of creating a new ObjCInstance, replace the existing instance's pointer with the new one and return it.

The adict object does not seem to be deallocated, the print statement seems to segfault for other reasons.

I wouldn't worry too much about that object. It might be deallocated, or it might be a placeholder that gets recycled by every call to NSDictionary.alloc, but either way, we have no right to access it after calling init. As the init documentation says, "You must therefore always use the object returned by init, and not the one returned by alloc or allocWithZone:, in subsequent code."

freakboy3742 · 2024-11-27T04:27:58Z

There is yet another hairy issue with the current implementation which will lead to memory leaks: alloc().init() chains where init changes the object's memory address, as documented at https://developer.apple.com/documentation/objectivec/nsobject/1418641-init?language=objc:

Interesting... I wonder if this was the thing triggering #539...

NSDictionary.alloc().init() where the init returns a different object
The above has two problems:

It leads to segfaults if the user attempts to use the allocated object after the autorelease pool was drained.

It leaks memory because the initialized object is retained twice but Rubicon is only aware of one of the retains.

I guess then we'll also have to special-case methods starting with init. If init returns a different object to the one it was called on, then instead of creating a new ObjCInstance, replace the existing instance's pointer with the new one and return it.

To make sure we're on the same page, AIUI, your proposal is extra handling in ObjCBountMethod.__call__ that will be invoked if self.method.name.startswith("init"); this logic will:

check that self.receiver (the object on which the method is invoked) is the same value as the return value of the method; 2. If it isn't the same pointer, __call__ will modify the ObjCInstance instance itself, and the ObjCInstance cache to reflect the new value, releasing the "old" version from the alloc
If it is the same pointer, it does nothing and just returns the pointer as it normally would.

That sounds broadly reasonable to me; I have 2 questions/concerns:

Are there "init" methods that don't start with init? I can't think of any examples off the top of my head, but that might be my memory fading with age.
Is there a particular reason you suggest patching the old instance, rather than creating a new instance? It seems like this would be less complex to implement (possibly even the default behavior?), with the only downsides being the cost of creating a second Python object, and the fact that the alloc object and the init object aren't the same - but that mirrors the underlying ObjC behavior, so it strikes me as the sort of thing that can be documented.

freakboy3742 · 2024-11-27T05:14:02Z

I guess then we'll also have to special-case methods starting with init. If init returns a different object to the one it was called on, then instead of creating a new ObjCInstance, replace the existing instance's pointer with the new one and return it.

Something that just occurred to me in reviewing beeware/toga#2978 in the context of this change - does this also resolve the NSImage "init fail" issue? The return value from init will be different to the alloc'd object... so the alloc'd object should be released; the only catch is that we don't need to create the ObjCInstance because it's a None object.

freakboy3742

This is looking pretty good to me. Toga has an existing <0.5 pin, so we're safe to do a major update here without breaking Toga; and beeware/toga#2978 is a pretty clear "Delete all your manual retain/release" PR, except for one copy edge case, and the NSImage init failure case (which, AFAICT, will be solved if we address the "init may change address" issue.

freakboy3742 · 2024-11-27T04:37:59Z

src/rubicon/objc/api.py

+                cached_obj = cls._cached_objects[object_ptr.value]
+
+                # If a cached instance was returned from a call such as `copy` or
+                # `mutableCopy`, we take ownership of an additional refcount. Release


Flagging so it isn't forgotten - a link here to the NSCopying docs would be helpful, plus highlighting that copy can return the same object as an optimisation if the object is immutable.

freakboy3742 · 2024-11-27T05:08:13Z

src/rubicon/objc/api.py

+                # it here to prevent leaking memory, Python already owns a refcount from
+                # when the item was put in the cache.
+                if _returned_from_method.startswith(_OWNERSHIP_METHOD_PREFIXES):
+                    send_message(object_ptr, "release", restype=objc_id, argtypes=[])


I agree. An extra refcount starts to get into re-implementing garbage collection territory, and I'm not sure that's a game that is worth it. I'm comfortable with this as a known edge case, and documenting it with some related usage patterns and advice.

samschott added 3 commits November 19, 2024 23:44

Retain on ObjCInstance creation, autorelease on __del__

9abcd0d

update tests

6605842

add change note

931c352

samschott force-pushed the updated-memory-management branch from 89a4d62 to 931c352 Compare November 19, 2024 22:44

freakboy3742 reviewed Nov 20, 2024

View reviewed changes

freakboy3742 mentioned this pull request Nov 20, 2024

Force cache eviction when instance classes don't match. #540

Open

4 tasks

samschott added 3 commits November 20, 2024 21:11

use autorelease instead of release in __del__

a618f2a

code formatting

b1bf61c

update docs

21f2e0b

samschott marked this pull request as ready for review November 21, 2024 01:12

samschott added 2 commits November 23, 2024 12:42

add comment about autorelease vs release

20ab8f9

remove now unneeded cache staleness check

160c819

samschott mentioned this pull request Nov 23, 2024

Rubicon update beeware/toga#2978

Draft

remove stale instance cache tests

ce9d78c

mhsmith reviewed Nov 24, 2024

View reviewed changes

samschott and others added 7 commits November 24, 2024 21:42

update test_objcinstance_dealloc

6d89330

correct inline comment

3bb7ccc

Co-authored-by: Malcolm Smith <[email protected]>

make returned_from_method private

c0b091c

update ObjCInstance doc string

ab1f762

updated docs

544d694

update spellchecker

b4a1624

update change notes with migration instructions

f0edb5b

samschott force-pushed the updated-memory-management branch from 6648cd0 to f0edb5b Compare November 24, 2024 21:52

mhsmith reviewed Nov 25, 2024

View reviewed changes

changes/256.removal.rst Outdated Show resolved Hide resolved

src/rubicon/objc/api.py Outdated Show resolved Hide resolved

changes/256.bugfix.rst Outdated Show resolved Hide resolved

mhsmith reviewed Nov 25, 2024

View reviewed changes

tests/test_core.py Outdated Show resolved Hide resolved

tests/test_core.py Outdated Show resolved Hide resolved

tests/test_core.py Outdated Show resolved Hide resolved

samschott and others added 7 commits November 25, 2024 18:28

remove unneeded space in doc string

7d51fde

Co-authored-by: Malcolm Smith <[email protected]>

change bugfix to feature note

acfa546

Fix incorrect inline comment

18e08cc

Co-authored-by: Malcolm Smith <[email protected]>

trim trailing whitespace

532fbe0

update test comment

52e92c0

check that objects are not deallocated before end of autorelease pool

efed734

merge object lifecycle tests

ab8a895

samschott added 2 commits November 25, 2024 23:12

add a test case for copyWithZone returning the existing instance with…

30e4277

… an additional refcount

release additional refcounts by copy calls on the same ObjCInstance

c3a4fe1

samschott commented Nov 25, 2024

View reviewed changes

samschott added 5 commits November 26, 2024 10:01

rewrite the copy lifecycle test to use NSDictionary instead of a cust…

7bdc31f

…om object

prevent errors on ObjCInstance garbage collection when send_message…

460728b

… is None

switch copy lifecycle test to use NSString

d9c0f62

remove unused import

49d9381

fix spelling mistake

e0d7792

mhsmith reviewed Nov 26, 2024

View reviewed changes

src/rubicon/objc/api.py Outdated Show resolved Hide resolved

tests/test_core.py Outdated Show resolved Hide resolved

src/rubicon/objc/api.py Outdated Show resolved Hide resolved

samschott and others added 3 commits November 26, 2024 21:29

spelling updates

715912f

Co-authored-by: Malcolm Smith <[email protected]>

spelling updates

20e45b6

Co-authored-by: Malcolm Smith <[email protected]>

spelling updates

86b29a4

Co-authored-by: Malcolm Smith <[email protected]>

black code formatting

944328d

mhsmith mentioned this pull request Nov 26, 2024

Proposal for nicer init() #26

Open

freakboy3742 reviewed Nov 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated memory management #543

Updated memory management #543

samschott commented Nov 19, 2024 •

edited

Loading

samschott commented Nov 19, 2024

freakboy3742 left a comment

freakboy3742 commented Nov 20, 2024

samschott commented Nov 20, 2024 •

edited

Loading

mhsmith commented Nov 20, 2024

samschott commented Nov 23, 2024 •

edited

Loading

mhsmith left a comment

freakboy3742 commented Nov 25, 2024

samschott commented Nov 25, 2024

samschott Nov 25, 2024

mhsmith Nov 26, 2024

freakboy3742 Nov 27, 2024

samschott commented Nov 26, 2024

samschott commented Nov 26, 2024 •

edited

Loading

samschott commented Nov 26, 2024

mhsmith commented Nov 26, 2024 •

edited

Loading

freakboy3742 commented Nov 27, 2024

freakboy3742 commented Nov 27, 2024

freakboy3742 left a comment

freakboy3742 Nov 27, 2024

freakboy3742 Nov 27, 2024

Updated memory management #543

Are you sure you want to change the base?

Updated memory management #543

Conversation

samschott commented Nov 19, 2024 • edited Loading

PR Checklist:

samschott commented Nov 19, 2024

freakboy3742 left a comment

Choose a reason for hiding this comment

freakboy3742 commented Nov 20, 2024

samschott commented Nov 20, 2024 • edited Loading

mhsmith commented Nov 20, 2024

samschott commented Nov 23, 2024 • edited Loading

mhsmith left a comment

Choose a reason for hiding this comment

freakboy3742 commented Nov 25, 2024

samschott commented Nov 25, 2024

samschott Nov 25, 2024

Choose a reason for hiding this comment

mhsmith Nov 26, 2024

Choose a reason for hiding this comment

freakboy3742 Nov 27, 2024

Choose a reason for hiding this comment

samschott commented Nov 26, 2024

samschott commented Nov 26, 2024 • edited Loading

samschott commented Nov 26, 2024

mhsmith commented Nov 26, 2024 • edited Loading

freakboy3742 commented Nov 27, 2024

freakboy3742 commented Nov 27, 2024

freakboy3742 left a comment

Choose a reason for hiding this comment

freakboy3742 Nov 27, 2024

Choose a reason for hiding this comment

freakboy3742 Nov 27, 2024

Choose a reason for hiding this comment

samschott commented Nov 19, 2024 •

edited

Loading

samschott commented Nov 20, 2024 •

edited

Loading

samschott commented Nov 23, 2024 •

edited

Loading

samschott commented Nov 26, 2024 •

edited

Loading

mhsmith commented Nov 26, 2024 •

edited

Loading