homepage

This issue tracker has been migrated to GitHub , and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author mark.dickinson
Recipients christian.heimes, gregory.p.smith, mark.dickinson, vstinner
Date 2008年11月11日.16:30:50
SpamBayes Score 0.0040788488
Marked as misclassified No
Message-id <1226421060.69.0.087682076799.issue4258@psf.upfronthosting.co.za>
In-reply-to
Content
Here's a version of the 15-bit to 30-bit patch
that adds in a souped-up version of Mario Pernici's
faster multiplication.
I did some testing of 100x100 digit and 1000x1000 digit
multiplies. On 32-bit x86:
 100 x 100 digits : around 2.5 times faster
 1000 x 1000 digits : around 3 times faster.
On x86_64, I'm getting more spectacular results:
 100 x 100 digits : around 5 times faster
 1000 x 1000 digits: around 7 times faster!
The idea of the faster multiplication is quite simple:
with 30-bit digits, one can fit a sum of 16 30-bit by
30-bit products in a uint64_t. This means that the
inner loop for the basecase grade-school multiplication
contains one fewer addition and no mask and shift.
[Victor, please don't delete the old longdigit4.patch!]
History
Date User Action Args
2008年11月11日 16:31:00mark.dickinsonsetrecipients: + mark.dickinson, gregory.p.smith, vstinner, christian.heimes
2008年11月11日 16:31:00mark.dickinsonsetmessageid: <1226421060.69.0.087682076799.issue4258@psf.upfronthosting.co.za>
2008年11月11日 16:31:00mark.dickinsonlinkissue4258 messages
2008年11月11日 16:30:59mark.dickinsoncreate

AltStyle によって変換されたページ (->オリジナル) /