Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Releases: arp242/uni

v2.9.0

16 Dec 22:19
@arp242 arp242
f66ba1d
This commit was signed with the committer’s verified signature.
arp242 Martin Tournoij
SSH Key Fingerprint: 2DHReRWdgRXpX3kSKVo4wU80KoRuqXGmv81g8KgrXbA
Verified
Learn about vigilant mode.

Choose a tag to compare

  • Update to Unicode 17.0.

  • Fix sorting of print and search with a custom -format flag (it sorted randomly if %(dec) wasn't included).

  • Sometimes the header would be printed on the second line when U+00 was included in the output

  • Recognise c:[cat-name] to print a category in addition to cat:[cat-name].

Assets 12
Loading

v2.8.0

11 Sep 00:06
@arp242 arp242
1d280da
This commit was signed with the committer’s verified signature.
arp242 Martin Tournoij
SSH Key Fingerprint: 2DHReRWdgRXpX3kSKVo4wU80KoRuqXGmv81g8KgrXbA
Verified
Learn about vigilant mode.

Choose a tag to compare

Update to Unicode 16.0.

Loading

v2.7.0

22 May 16:19
@arp242 arp242
481d298
This commit was signed with the committer’s verified signature.
arp242 Martin Tournoij
SSH Key Fingerprint: 2DHReRWdgRXpX3kSKVo4wU80KoRuqXGmv81g8KgrXbA
Verified
Learn about vigilant mode.

Choose a tag to compare

  • Improve -format flag:

    • Add %name as an alias for %(name l:auto); this is a lot less typing and requires less shell quoting, and >90% of the time this is what you want.

    • Automatically prepend character, codepoint, and name if the format flag starts with +; for example:

      % uni identify -f +'%unicode %plane' a
       Name Unicode Plane
      'a' U+0061 LATIN SMALL LETTER A 1.1 Basic Multilingual Plane
      

    This should make quickly printing some property a lot quicker.

  • Align and colourize JSON output.

  • Update CLDR information, adding significantly more aliases for emojis.

  • Add cells column, which returns how many cells a codepoint will display at (0, 1, or 2).

  • Add aliases column, which lists the alias names. Also add this to the default output:

    % uni s factorial
     CPoint Dec UTF8 HTML Name Aliases
    '!' U+0021 33 21 ! EXCLAMATION MARK [factorial, bang]
    
  • Add refs columns, which references other related/similar codepoints:

    % uni p -q U+46 -f '%(name): %(refs)'
    LATIN CAPITAL LETTER F: U+2109, U+2131, U+2132
    % uni p -q U+46 -f '%(refs)' | uni p
     CPoint Dec UTF8 HTML Name Aliases
    '°F' U+2109 8457 e2 84 89 ℉ DEGREE FAHRENHEIT
    'F' U+2131 8497 e2 84 b1 ℱ SCRIPT CAPITAL F [Fourier transform]
    'Ⅎ' U+2132 8498 e2 84 b2 Ⅎ TURNED CAPITAL F [Claudian digamma inversum]
    
  • Allow arguments to printto start or end with a comma or slash. This comes up when copy/pasting some list of codepoints from another source; there's no real reason to error out on this.

  • Allow listing unicode versions with uni list unicode and planes with uni list planes.

  • uni list without arguments errors, instead of listing all.

  • Add h format flag to not print the header for this column.

Loading
jwodder, epilys, and claui reacted with thumbs up emoji
3 people reacted

v2.6.0

24 Nov 23:39
@arp242 arp242
b25052b
This commit was signed with the committer’s verified signature.
arp242 Martin Tournoij
SSH Key Fingerprint: 2DHReRWdgRXpX3kSKVo4wU80KoRuqXGmv81g8KgrXbA
Verified
Learn about vigilant mode.

Choose a tag to compare

  • Update to Unicode 15.1.

  • Add "script" property – also supported in the list and print commands:

    % uni identify -f '%(script l:auto) %(cpoint) %(name)' 'a Ω'
    Script CPoint Name
    Latin U+0061 LATIN SMALL LETTER A
    Common U+0020 SPACE
    Greek U+03A9 GREEK CAPITAL LETTER OMEGA
    % uni list scripts
    Scripts:
    Name Assigned
    Adlam 83
    Ahom 54
    Anatolian Hieroglyphs 582
    ...
    % uni print 'script:linear a'
    Showing script Linear A
     CPoint Dec UTF8 HTML Name (Cat)
    '𐘀' U+10600 67072 f0 90 98 80 𐘀 LINEAR A SIGN AB001 (Other_Letter)
    '𐘁' U+10601 67073 f0 90 98 81 𐘁 LINEAR A SIGN AB002 (Other_Letter)
    '𐘂' U+10602 67074 f0 90 98 82 𐘂 LINEAR A SIGN AB003 (Other_Letter)
    ...
    
  • Add "unicode" property, which tells you in which Unicode version a codepoint was introduced:

    % uni identify -f '%(unicode l:auto) %(cpoint l:auto) %(name)' a𐘂🫁
    Unicode CPoint Name
    1.1 U+0061 LATIN SMALL LETTER A
    7.0 U+10602 LINEAR A SIGN AB003
    13.0 U+1FAC1 LUNGS
    
  • Show unprintable control characters as the open box (␣, U+2423) instead of the replacement character (�, U+FFFD). It already did that for C1 control characters, and U+FFFD looked more like a bug than intentional. The -raw/-r flag still overrides this.

  • Always print Private Use characters as-is for %(char) instead of using U+FFFD replacement character. It's usually safe to print this, and having to use -raw is confusing.

  • ls command is now an alias for `list.

Loading
ismail and z1nc0r3 reacted with hooray emoji
2 people reacted

v2.5.1

09 May 15:41
@arp242 arp242
f33796f
This commit was signed with the committer’s verified signature.
arp242 Martin Tournoij
SSH Key Fingerprint: 2DHReRWdgRXpX3kSKVo4wU80KoRuqXGmv81g8KgrXbA
Verified
Learn about vigilant mode.

Choose a tag to compare

  • Fix build on Go 1.17 and earlier.
Loading
svengreb reacted with thumbs up emoji
1 person reacted

v2.5.0

03 May 16:47
@arp242 arp242
4c9e955
This commit was signed with the committer’s verified signature.
arp242 Martin Tournoij
SSH Key Fingerprint: 2DHReRWdgRXpX3kSKVo4wU80KoRuqXGmv81g8KgrXbA
Verified
Learn about vigilant mode.

Choose a tag to compare

  • Add support for properties; they can be displayed with %(props) in -format, and selected in print (e.g. uni print dash).

  • Add uni list command, to list categories, blocks, and properties.

  • Allow explicitly selecting a block, category, or property in print with block:name (b:name), category:name (cat:name, c:name), or property:name (prop:name, p:name).

    Also print an error if a string without prefix matched more than one group (i.e. uni p dash matches both the property Dash and category Dash_Punctuation).

  • Add table layout with -as table. Also change -json/-j to -as json or -as j. The -json flag is still accepted as an alias for compatibility.

  • Change -q/-quiet to -c/-compact; -as json will print as minified if given, and -as table will include less padding. -q is still accepted as an alias for compatibility.

  • Don't use the Go stdlib unicode package; since this is a Unicode 13 database and some operations would fail on codepoints added in Unicode 14 due to the mismatch.

Loading
esdnm and subins2000 reacted with hooray emoji
2 people reacted

v2.4.0

20 Dec 18:36
@arp242 arp242
62da7a3
This commit was signed with the committer’s verified signature.
arp242 Martin Tournoij
SSH Key Fingerprint: 2DHReRWdgRXpX3kSKVo4wU80KoRuqXGmv81g8KgrXbA
Verified
Learn about vigilant mode.

Choose a tag to compare

  • Update import path to zgo.at/uni/v2.

  • Add oct and bin flags for -f to print a codepoint as octal or binary.

  • Add f format flag to change the fill character with alignment; e.g. %(bin r:auto f:0) will print zeros on the left.

  • Allow using just o123 for an octal number (instead of 0o123). We can't do this for binary and decimal numbers (since b and d are valid hexidecimals), but no reason not to do it for o.

Loading

Release v2.3.0

05 Oct 13:21
@arp242 arp242

Choose a tag to compare

  • Update to Unicode 14.0.

  • UTF-16 and JSON are printed as lower case, just like UTF-8 was. Upper-case is used only for codepoints (i.e. U+00AC).

  • uni print can now print from UTF-8 byte sequence; for example to print the € sign:

    uni p utf8:e282ac
    uni p 'utf8:e2 82 ac'
    uni p 'utf8:0xe2 0x82 0xac'
    

    Bytes can optionally be separated by any combination of 0x, -, _, or spaces.

Loading
atc0005 and olehub reacted with thumbs up emoji
2 people reacted

v2.2.1

15 Jun 02:33
@arp242 arp242

Choose a tag to compare

Only one small change:

You can now use uni p 0d40 to get U+28 by decimal.

uni print 40 interprets the 40 as hex instead of decimal, and there was no way to get a codepoint by decimal number. Since codepoints are much more more common than decimals, leaving off the U+ and U is a useful shortcut I'd like to keep. AFAIK there isn't really a standard(-ish) was to explicitly indicate a number is a decimal, so this is probably the closest.

Loading

v2.2.0

05 Jun 15:39
@arp242 arp242

Choose a tag to compare

Very minor bugfix release:

  • Make proper use of the /v2 import path so that go get and go install work. (#26)

  • Don't panic if -f doesn't contain any formatting characters.

Loading
Previous 1
Previous

AltStyle によって変換されたページ (->オリジナル) /