This issue tracker has been migrated to GitHub ,
and is currently read-only.
For more information,
see the GitHub FAQs in the Python's Developer Guide.
| Author | phr |
|---|---|
| Recipients | phr |
| Date | 2008年05月15日.03:08:29 |
| SpamBayes Score | 0.032561366 |
| Marked as misclassified | No |
| Message-id | <1210820919.45.0.733381289954.issue2857@psf.upfronthosting.co.za> |
| In-reply-to |
| Content | |
|---|---|
For object serialization and some other purposes, Java encodes unicode strings with a modified version of utf-8: http://en.wikipedia.org/wiki/UTF-8#Java http://java.sun.com/javase/6/docs/api/java/io/DataInput.html#modified-utf-8 It is used in Lucene index files among other places. It would be useful if Python had a codec for this, maybe called "UTF-8J" or something like that. |
|
| History | |||
|---|---|---|---|
| Date | User | Action | Args |
| 2008年05月15日 03:08:40 | phr | set | spambayes_score: 0.0325614 -> 0.032561366 recipients: + phr |
| 2008年05月15日 03:08:40 | phr | set | spambayes_score: 0.0325614 -> 0.0325614 messageid: <1210820919.45.0.733381289954.issue2857@psf.upfronthosting.co.za> |
| 2008年05月15日 03:08:38 | phr | link | issue2857 messages |
| 2008年05月15日 03:08:37 | phr | create | |