-
Committer:
Jacques Distler
-
Date:
2008年05月17日 06:43:11 UTC
-
Revision ID:
distler@golem.ph.utexas.edu-20080517064311-ppeemqnct2lzm2bc
Efficiency: Entity handling
Previously, used a regexp to find and convert named entities in the content.
Now use a more efficient algorithm.
Similar tweak for converting NCRs before checking whether text is valid utf-8.