[R] Inserting 17M entries into env took 18h, inserting 34M entries taking 5+ days

Magnus Thor Torfason zulutime.net at gmail.com
Mon Nov 4 12:19:50 CET 2013

There are around 16M unique values. After accounting for equivalence, 
the number is much smaller (I don't know how much smaller, since my 
program has not completed yet :-)

Yes, I meant that "B and C are also equivalent". The original version 
was a typo.


On 11/1/2013 3:45 PM, jim holtman wrote:
> in the 20M pairs, how many unique values are there?  In your statement
> above " But equivalence is transitive, so if A and B occur together in
> one pair, and A and C occur together in another pair, then A and C are
> also equivalent.", did you mean that "B and C are also equivalent"?
> Jim Holtman
> Data Munger Guru

