Edit - Code Golf Stack Exchange

You are not logged in. Your edit will be placed in a queue until it is peer reviewed.

We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.

Required fields*

Rev

Required fields*

Who needs 8 bits for one character?

Given a string and the characters used to encode it, you need to compress the string by only using as many bits as each character needs. You will return the character codes for each character needed to create a compressed string.

For example, given the string "the fox" and the encoder characters " abcdefghijklmnopqrstuvwxyz", the output should be [170, 76, 19, 195, 32].

How, though?

First, you need to map each encoder character to some bits. If we have the encoder characters abc, then we can map the characters to bits, by mapping the character to the position of the character in binary, like this:

a => 01
b => 10
c => 11

With 13579, we would map it like this:

1 => 001
3 => 010
5 => 011
7 => 100
9 => 101

Note that we pad zeros at the beginning as many as necessary.

Next, we would go through the string, and for each character, we would get the corresponding bits for that character. Then join all the bits together, and then convert to chunks of 8 to get the bytes. If the last byte is not 8 bits long, add zeros at the end till it is 8 bits long. Lastly, convert each byte to its decimal representation.

Reverse challenge is here.

Test cases

String: "the fox", encoder characters: " abcdefghijklmnopqrstuvwxyz" => [170, 76, 19, 195, 32]
String: "971428563", encoder characters: "123456789" => [151, 20, 40, 86, 48]
String: "the quick brown fox jumps over the lazy dog", encoder characters: " abcdefghijklmnopqrstuvwxyz" => [170, 76, 25, 89, 68, 96, 71, 56, 97, 225, 60, 50, 21, 217, 209, 160, 97, 115, 76, 53, 73, 130, 209, 111, 65, 44, 16]
String: "abc", encoder characters: "abc" => [108]
String: "aaaaaaaa", encoder characters: "a" => [255]
String: "aaaabbbb", encoder characters: "ab" => [85, 170]

Rules

Inputs can be a string, list, or even list of character codes. It doesn't matter, I/O is very flexible for this challenge.
Input will always be valid, e.g. the string will never include characters not in the encoder characters, etc.
Encoder characters will always contain **less than 256 characters.
Neither input will ever be empty.
This is code-golf, so the shortest answer in bytes for each language wins.
Standard I/O rules apply.
Default loopholes are forbidden.

Reference implementation in JavaScript

function encode(str, encoderChars) {
 const maxBitCount = Math.ceil(Math.log2(encoderChars.length + 1));
 const charToBit = Object.fromEntries(encoderChars.map((c, i) => [c, (i + 1).toString(2).padStart(maxBitCount, "0")]));
 const bits = [...str].map((c) => charToBit[c]).join("");
 const bytes = bits.match(/.{1,8}/g) || [];
 return bytes.map((x) => parseInt(x.padEnd(8, '0'), 2));
}

Attempt This Online!

Answer*

# [Jelly], 21 bytes

 JBUz0ZU⁸,yFŻ8¡s8z0ZḊḄ

[Try It Online!](https://jht.hyper-neutrino.xyz/tio#WyIiLCJKQlV6MFpV4oG4LHlGxbs4wqFzOHowWuG4iuG4hCIsIiIsIiIsWyIgYWJjZGVmZ2hpamtsbW5vcHFyc3R1dnd4eXoiLCJ0aGUgZm94Il1d)

[Jelly]: https://github.com/DennisMitchell/jellylanguage

```
JBUz0ZU⁸,yFŻ8¡s8z0ZḊḄ Main Link
J Generate [1, 2, ...] as long as the encoding list
 B Convert to binary
 Uz0ZU Reverse, transpose padding with 0, transpose, reverse (*)
 ⁸, Pair the encoding list with the padded binary list
 y Use this to translate the message into binary strings
 F Flatten
 Ż8¡ Prepend 0 8 times (in case the list is shorter than 8)
 s8 Slice into blocks of 8
 z0Z Transpose padding with 0, transpose (**)
 Ḋ Remove the block of 8 zeroes at the start
 Ḅ Convert from binary into numbers

(*) this is a common pattern used to left-pad everything to the same length
(**) this is a common pattern used to right-pad everything to the same length -
 because we prepended a block of 8 zeroes, there is at least one sublist
 of length 8, so everything will be padded to length 8
```

If this is an answer to a challenge…

…Be sure to follow the challenge specification. However, please refrain from exploiting obvious loopholes. Answers abusing any of the standard loopholes are considered invalid. If you think a specification is unclear or underspecified, comment on the question instead.
…Try to optimize your score. For instance, answers to code-golf challenges should attempt to be as short as possible. You can always include a readable version of the code in addition to the competitive one. Explanations of your answer make it more interesting to read and are very much encouraged.
…Include a short header which indicates the language(s) of your code and its score, as defined by the challenge.

More generally…

…Please make sure to answer the question and provide sufficient detail.
…Avoid asking for help, clarification or responding to other answers (use comments instead).

Draft saved

Draft discarded

Edit Summary*

Cancel

\$\begingroup\$ Enumerating all of the encodings then translating is a brilliant way to make sure they're all the right length! I was trying iⱮ;L{’ lmao \$\endgroup\$

Unrelated String
– Unrelated String

2022年05月06日 21:27:01 +00:00
Commented May 6, 2022 at 21:27

Add a comment |

How to Edit

Correct minor typos or mistakes
Clarify meaning without changing it
Add related resources or links
Always respect the author’s intent
Don’t use edits to reply to the author

How to Format

create code fences with backticks ` or tildes ~
```
like so
```
add language identifier to highlight code
```python
def function(foo):
print(foo)
```
put returns between paragraphs
for linebreak add 2 spaces at end
_italic_ or **bold**
indent code by 4 spaces
backtick escapes `like _so_`
quote by placing > at start of line
to make links (use https whenever possible)

<https://example.com>

[example](https://example.com)

<a href="https://example.com">example</a>
MathJax equations \$\sin^2 \theta\$

formatting help »
answering help »

MathJax help »

How to Tag

A tag is a keyword or label that categorizes your question with other, similar questions. Choose one or more (up to 5) tags that will help answerers to find and interpret your question.

complete the sentence: my question is about...
use tags that describe things or concepts that are essential, not incidental to your question
favor using existing popular tags
read the descriptions that appear below the tag

If your question is primarily about a topic for which you can't find a tag:

combine multiple words into single-words with hyphens (e.g. code-golf), up to a maximum of 35 characters
creating new tags is a privilege; if you can't yet create a tag you need, then post this question without it, then ask the community to create it for you

popular tags »