Bug 1561 – AA's create many false references for garbage collector

Status
RESOLVED
Resolution
WORKSFORME
Severity
major
Priority
P2
Component
phobos
Product
D
Version
D2
Platform
x86
OS
Windows
Creation time
2007-10-09T14:44:00Z
Last change time
2015-06-09T05:10:41Z
Assigned to
nobody
Creator
wbaxter

Comments

Comment #0 by wbaxter — 2007-10-09T14:44:52Z
A program that uses a lot of AA's will leak memory. It looks like maybe the reason is that the aaA structs which contain the hash value are allocated as void[size], so the hash value is always interpreted as a pointer. Tangos version of aaA.d does it slightly differently, checking the size and setting the gc NO_SCAN bit if the key and value types can't hold a pointer: // Not found, create new elem //printf("create new one\n"); size_t size = aaA.sizeof + keysize + valuesize; uint bits = keysize < (void*).sizeof && keysize > (void).sizeof && valuesize < (void*).sizeof && valuesize > (void).sizeof ? BlkAttr.NO_SCAN : 0; e = cast(aaA *) gc_calloc(size, bits); Test case for leakage using AA's with phobos below. Not sure what this does with Tango, but I think it will probably still fail on a 32-bit architecture since int.sizeof is in the range of sizes that still gets scanned. It seems like a better approach would be to a) use keyti's TypeInfo to decide if it's a pointer type or not and b) arrange for _aaGet to be called with the value type's TypeInfo too, instead of just a size, and also use that to decide if the alloc'ed memory should have the NO_SCAN bit set or not. It could be that all of the above diagnostic guesswork is wrong. But I am certain that the program below leaks memory, whatever the actual reason. ---------------< test case >----------------------- import std.stdio; import std.gc; // Just an ordinary AA with a lot of values. // neither keys nor values look like pointers. class BigAA { int[int] aa; this() { for(int i=0;i<1000;i++) { aa[i] = i; } } } void main() { int nloops = 10_000; auto b = new BigAA[100]; for(int i=0; i<nloops; ++i) { // Create some AAs (overwriting old ones) foreach(ref v; b) { v = new BigAA; } // See how we're doing std.gc.GCStats stats; std.gc.fullCollect(); std.gc.getStats(stats); writefln("Loop %-5s - poolsize=%-10s %s Mbytes (%s KB)", i, stats.poolsize, stats.usedsize/1024/1024, stats.usedsize/1024); } }
Comment #1 by kamm-removethis — 2007-10-10T13:14:45Z
I compiled and ran the program against tango using dmd 1.020. Since there doesn't seem to be gcstats equivalent functionality, I removed it and monitored the memory usage with top. I terminated the process after eight minutes of it running: it consumed a constant and small amount of memory during its execution. With dmd 1.021 and phobos, I see the memory usage growing over time.
Comment #2 by wbaxter — 2007-10-10T20:04:26Z
Thanks for trying it out with Tango. But if Tango doesn't show the problem then that almost certainly means my diagnosis is incorrect.
Comment #3 by andy — 2015-01-24T15:24:16Z
tested on DMD v2.066.1 Also without gc.getStats, just watching top. No memory leak detected. It actually acts the same for me with or without the call to GC.collect(); As this report includes both D1 and D2, and its not an issue in D2, I'd recommend closing. My Test: import std.stdio; // Just an ordinary AA with a lot of values. // neither keys nor values look like pointers. class BigAA { int[int] aa; this() { for(int i=0;i<1000;i++) { aa[i] = i; } } } void main() { int nloops = 10_000; auto b = new BigAA[100]; for(int i=0; i<nloops; ++i) { // Create some AAs (overwriting old ones) foreach(ref v; b) { v = new BigAA; } //GC.collect(); } }
Comment #4 by schveiguy — 2015-01-26T12:30:35Z
Yeah, D2 uses the same runtime as Tango used once upon a time. So I think we can close this.