homepage

This issue tracker has been migrated to GitHub , and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author phr
Recipients phr
Date 2008年05月15日.03:08:29
SpamBayes Score 0.032561366
Marked as misclassified No
Message-id <1210820919.45.0.733381289954.issue2857@psf.upfronthosting.co.za>
In-reply-to
Content
For object serialization and some other purposes, Java encodes unicode
strings with a modified version of utf-8:
http://en.wikipedia.org/wiki/UTF-8#Java
http://java.sun.com/javase/6/docs/api/java/io/DataInput.html#modified-utf-8
It is used in Lucene index files among other places.
It would be useful if Python had a codec for this, maybe called "UTF-8J"
or something like that.
History
Date User Action Args
2008年05月15日 03:08:40phrsetspambayes_score: 0.0325614 -> 0.032561366
recipients: + phr
2008年05月15日 03:08:40phrsetspambayes_score: 0.0325614 -> 0.0325614
messageid: <1210820919.45.0.733381289954.issue2857@psf.upfronthosting.co.za>
2008年05月15日 03:08:38phrlinkissue2857 messages
2008年05月15日 03:08:37phrcreate

AltStyle によって変換されたページ (->オリジナル) /