Results 1 to 12 of 12

Thread: A nooblike ROLZ compressor

- Show Printable Version
- Email this Page…
- Advanced Search
- Linear Mode
- Switch to Hybrid Mode
- Switch to Threaded Mode

7th October 2012, 19:06 #1
RichSelian

View Profile

View Forum Posts

Private Message
RichSelian is offline
Member RichSelian's Avatar

Join Date

Aug 2011

Location

Shenzhen, China

Posts

191

Thanks

21

Thanked 73 Times in 36 Posts
Talking A nooblike ROLZ compressor
since I'm new to ROLZ, the time performance is not good at all. But it seems to make better compression ratio the a normal LZ77 compressor!

Code:

world95.txt 3005020 => 557787 bible.txt 4047392 => 811732 fp.log 20617071 => 681905

http://comprox.googlecode.com/files/...z-0.1.0.tar.gz
Reply With Quote Reply With Quote
7th October 2012, 20:25 #2
Stephan Busch

View Profile

View Forum Posts

Private Message

Visit Homepage
Stephan Busch is offline
Tester
Stephan Busch's Avatar

Join Date

May 2008

Location

Bremen, Germany

Posts

879

Thanks

476

Thanked 176 Times in 86 Posts
For compiling, my MinGW needs crblib, but the one provided by Charles Bloom doesn't seem to work:

dpcm.c:63:24: schwerwiegender Fehler: crbinc/inc.h: No such file or directory
Kompilierung beendet.
imppm.c:8:24: schwerwiegender Fehler: crbinc/inc.h: No such file or directory
Kompilierung beendet.

C:\MinGW\bin>gcc -O3 *.c
In file included from dpcm.c:63:0:
c:\mingw\bin\../lib/gcc/mingw32/4.6.2/../../../../include/crbinc/inc.h:23:28:
hwerwiegender Fehler: crblib/memutil.h: No such file or directory
Kompilierung beendet.
In file included from imppm.c:8:0:
c:\mingw\bin\../lib/gcc/mingw32/4.6.2/../../../../include/crbinc/inc.h:23:28:
hwerwiegender Fehler: crblib/memutil.h: No such file or directory
Kompilierung beendet.
Reply With Quote Reply With Quote
7th October 2012, 22:08 #3
Matt Mahoney

View Profile

View Forum Posts

Private Message

Visit Homepage
Matt Mahoney is offline
Expert
Matt Mahoney's Avatar

Join Date

May 2008

Location

Melbourne, Florida, USA

Posts

3,271

Thanks

315

Thanked 841 Times in 506 Posts
I compiled with no problem using "gcc -O3 *.c" with MinGW 4.6.1 (Win32). Test results:
http://mattmahoney.net/dc/text.html#2158
http://mattmahoney.net/dc/silesia.html

I guess you might need pthreadGC2.dll to run.
Reply With Quote Reply With Quote
8th October 2012, 00:58 #4
Stephan Busch

View Profile

View Forum Posts

Private Message

Visit Homepage
Stephan Busch is offline
Tester
Stephan Busch's Avatar

Join Date

May 2008

Location

Bremen, Germany

Posts

879

Thanks

476

Thanked 176 Times in 86 Posts
I have also compiled it now. Thanks for the hint.
Reply With Quote Reply With Quote
8th October 2012, 15:04 #5
RichSelian

View Profile

View Forum Posts

Private Message
RichSelian is offline
Member RichSelian's Avatar

Join Date

Aug 2011

Location

Shenzhen, China

Posts

191

Thanks

21

Thanked 73 Times in 36 Posts
Quote Originally Posted by Matt Mahoney View Post

I compiled with no problem using "gcc -O3 *.c" with MinGW 4.6.1 (Win32). Test results:
http://mattmahoney.net/dc/text.html#2158
http://mattmahoney.net/dc/silesia.html

I guess you might need pthreadGC2.dll to run.

Thanks for the benchmark.
I use native threading APIs on windows, so pthreadGC2.dll is no longer needed. (I successfully compiled and ran it with mingw32 and wine under linux)

I found that all my programs have bad performance on decompression. I think I'm playing with too long context for decoding literals.
Should I pay more attention on optimal parsing and reduce context length?
Reply With Quote Reply With Quote
8th October 2012, 18:44 #6
Matt Mahoney

View Profile

View Forum Posts

Private Message

Visit Homepage
Matt Mahoney is offline
Expert
Matt Mahoney's Avatar

Join Date

May 2008

Location

Melbourne, Florida, USA

Posts

3,271

Thanks

315

Thanked 841 Times in 506 Posts
I didn't look at the source, but ROLZ decompresses slower than LZ77 because the decompresser has to maintain an index.
Reply With Quote Reply With Quote
8th October 2012, 18:56 #7
joerg

View Profile

View Forum Posts

Private Message
joerg is offline
Member

Join Date

May 2008

Location

Germany

Posts

445

Thanks

59

Thanked 72 Times in 44 Posts
@RichSelian:

"I use native threading APIs on windows, so pthreadGC2.dll is no longer needed."

Can you please post a win32 - binary here in the forum?

i want to compare the program with the "open source program BALZ (ROLZ-compression) from encode"

the compression seems to be not bad ..

"bad performance on decompression"

maybe because your compression-algorithm can use 2 cores but your decompression-algorithm can use only 1 core?

best regards
Joerg
Reply With Quote Reply With Quote
9th October 2012, 06:21 #8
Matt Mahoney

View Profile

View Forum Posts

Private Message

Visit Homepage
Matt Mahoney is offline
Expert
Matt Mahoney's Avatar

Join Date

May 2008

Location

Melbourne, Florida, USA

Posts

3,271

Thanks

315

Thanked 841 Times in 506 Posts
Compiled with gcc -O3 *c in MinGW 4.6.1 for Win32, packed with upx.

Attached Files Attached Files

File Type: exe comprolz.exe (108.5 KB, 438 views)
Reply With Quote Reply With Quote
9th October 2012, 16:56 #9
RichSelian

View Profile

View Forum Posts

Private Message
RichSelian is offline
Member RichSelian's Avatar

Join Date

Aug 2011

Location

Shenzhen, China

Posts

191

Thanks

21

Thanked 73 Times in 36 Posts
I use o2-o1 model to encode literals, it's the main reason that make decompression slow. but for good compression ratio it is necessary.
I don't understand why some LZ compressors (like xz) can make as good compression ratio while they are using o1 model?
Reply With Quote Reply With Quote
9th October 2012, 17:38 #10
Matt Mahoney

View Profile

View Forum Posts

Private Message

Visit Homepage
Matt Mahoney is offline
Expert
Matt Mahoney's Avatar

Join Date

May 2008

Location

Melbourne, Florida, USA

Posts

3,271

Thanks

315

Thanked 841 Times in 506 Posts
It seems like high order modeling of literals is not needed because they would be coded as matches instead. Also, some algorithms like LZMA use literal exclusion after matches. The first byte after a match would be poorly predicted by a model or else it would have extended the match, so it XORs it with the predicted byte.
Reply With Quote Reply With Quote
10th October 2012, 19:04 #11
RichSelian

View Profile

View Forum Posts

Private Message
RichSelian is offline
Member RichSelian's Avatar

Join Date

Aug 2011

Location

Shenzhen, China

Posts

191

Thanks

21

Thanked 73 Times in 36 Posts
Quote Originally Posted by Matt Mahoney View Post

It seems like high order modeling of literals is not needed because they would be coded as matches instead. Also, some algorithms like LZMA use literal exclusion after matches. The first byte after a match would be poorly predicted by a model or else it would have extended the match, so it XORs it with the predicted byte.

That means you have to search for len=3 matches? But will replacing 3 literals with a pos/len pair really help the compression ratio? (Maybe you are using some optimal parsing skills to limit match pos?)
Reply With Quote Reply With Quote
11th October 2012, 00:13 #12
Matt Mahoney

View Profile

View Forum Posts

Private Message

Visit Homepage
Matt Mahoney is offline
Expert
Matt Mahoney's Avatar

Join Date

May 2008

Location

Melbourne, Florida, USA

Posts

3,271

Thanks

315

Thanked 841 Times in 506 Posts
Length 3 matches aren't worth coding because these will be common even in random files with a 16 MB window, resulting in no compression. What I mean is that if you use a context models to predict literals, they will usually make a wrong prediction after a match. Your model needs to account for this.
Reply With Quote Reply With Quote

« Previous Thread | Next Thread »

Similar Threads

BALZ - An Open-Source ROLZ-based compressor

By encode in forum Data Compression

Replies: 66
Last Post: 8th September 2024, 18:22
ROLZ explanation?

By Trixter in forum Data Compression

Replies: 6
Last Post: 29th December 2022, 17:51
ROLZ and Search Trees ?

By Guzzo in forum Data Compression

Replies: 5
Last Post: 1st August 2012, 01:03
xp a rolz compressor

By pmcontext in forum Data Compression

Replies: 40
Last Post: 9th December 2010, 10:04
A small article on ROLZ (Russian)

By encode in forum Forum Archive

Replies: 21
Last Post: 29th April 2007, 16:18

Posting Permissions

You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
[VIDEO] code is On
HTML code is Off

Forum Rules

All times are GMT +3. The time now is 00:53.

Thread: A nooblike ROLZ compressor

Talking A nooblike ROLZ compressor

Similar Threads

BALZ - An Open-Source ROLZ-based compressor

ROLZ explanation?

ROLZ and Search Trees ?

xp a rolz compressor

A small article on ROLZ (Russian)

Posting Permissions