|
 |
 |
 |
 |
 |
 |
The Art
of Lossless
Data Compression
vol. 25t
Here are the results of tests performed in August 2002 to compare
lossless compression of "plain" texts by all known good enough programs
developed for such purpose, including UHArc, PPMd, Bzip2, RAR, ACE and 7-zip.
See Archive Comparison Test by J.Gilchrist for more details:
http://compression.ca
If anybody wants to start or continue such tests,
or can suggest some other sets of texts, or other compression programs,
(not sources or algorithm descriptions, executable programs only)
or knows we have missed something important,
(some new fantastic technology, an algorithm or even a program capable
of lossless compression of up to 1000:1 etc.)
please let us know immediately: artest@inbox.ru Thank you!
[[1]] COMPRESSION QUALITY
=========================
(see also
[[2]] Speed
[[3]] Details
[[4]] Comments)
Last seventh line shows results for the sum of all 1231 texts in six sets.
Origin Entropy Durilca Compressia PPMonstr EPM PPMN Slim Paq1SSE BEE
553.31% 100% 100.06 100.45 101.76 102.57 105.51 101.71 102.72 107.19
543.05% 100% 103.57 103.68 108.50 108.01 110.03 111.16 109.65 112.67
435.56% 100% 104.58 103.74 107.96 106.88 107.05 107.18 108.02 111.37
492.76% 100% 105.47 106.15 110.61 110.32 110.13 110.90 111.77 114.63
806.27% 102.70 100% 107.91 101.67 105.66 116.95 109.37 110.90 112.44
360.54% 102.11 100% 108.90 104.66 105.39 103.20 104.25 114.17 107.39
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
468.44% 100% 102.79 105.63 107.25 107.35 107.48 107.68 110.72 111.24
PPMd PPMy RK RAR UHArc DC SBC BZip2 7-zip pkzip
107.64 105.50 105.64 108.58 105.31 108.66 108.98 124.06 152.15 159.23
113.63 112.82 112.26 113.72 112.70 114.28 115.55 130.92 170.89 178.03
111.94 110.06 110.78 112.57 110.83 111.19 112.57 124.48 156.43 163.20
115.48 114.41 114.58 116.10 115.54 115.77 116.82 131.87 167.87 174.67
110.02 133.75 111.48 121.00 118.71 122.51 119.10 150.99 198.80 207.15
107.84 107.79 112.30 108.42 115.47 114.02 112.63 119.31 145.92 151.56
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~..~~~~~~~~~~~~~~~~~~~~~~
111.76 112.11 112.23 112.90 113.88 114.02 114.27 127.59 161.00 167.57
Results of some other programs are in full version only, TEXTS.DAT file.
[[2]] Speed
===========
Canterbury Corpus Large Set http://corpus.canterbury.ac.nz/resources/large.zip
was used for this test, and a 970MHz PC with 256Mb RAM and Windows98.
Programs, Compression/ Overall Average Users' Compressed
options Extraction, Score Score Size
seconds seconds, % seconds, % bytes , %
no compression 0 0 4446 559 4446 577 16005619 600
ace32 a -d4096 66 2 1124 141 1058 137 3801917 142
ace32 a -d4096 -m1 31 2 1134 143 1104 143 3965841 149
ace32 a -d4096 -m5 206 2 1249 157 1045 136 3746553 140
arh a 38 40 1091 137 1053 137 3647067 137
arh a -2 -1 68 40 1121 141 1054 137 3647067 137
ba -k -50 35 12 964 121 929 121 3298943 124
bix a -mdg -s 92 1 1069 134 978 127 3514944 132
boa -m1 86 88 1253 158 1168 152 3886863 146
boa -m15 139 141 1165 146 1027 133 3182739 119
boa -m15 -s 138 140 1148 144 1011 131 3132810 117
bzip2 -k 21 6 1032 130 1011 131 3616113 136
bzip2 -k -9 20 6 1031 130 1011 131 3616113 136
Entropy t o12 94 95 1003 126 910 118 2932445 110
Entropy t o16 98 99 1001 126 904 117 2892711 108
Entropy t o32 105 106 1009 127 905 118 2873677 108
Entropy t o64 112 111 1022 128 911 118 2873318 108
compcl c -b15 37 20 904 114 868 113 3049569 114
compcl c -b15 -s 38 29 808 102 770 100 2668128 100
dc e 13 7 903 114 890 116 3179173 119
dc e -b16300 -mt5 17 7 795 100 778 101 2773427 104
eri a 39 17 936 118 897 116 3168414 119
eri a -m3 59 21 996 125 937 122 3295385 124
eri a -m6 59 21 989 124 931 121 3272926 123
gcac a 26 12 980 123 954 124 3390603 127
gcac s 26 12 981 123 955 124 3395064 127
imp98 a -mm 31 1 1175 148 1143 148 4112387 154
imp98 a -mm -2 13 5 999 126 986 128 3533761 132
imp98 a -2 -s4 13 5 999 126 986 128 3533693 132
pkzip -es 1 1 1654 208 1652 215 5945622 223
pkzip -a 4 1 1308 164 1304 169 4691491 176
pkzip -exx 16 1 1296 163 1280 166 4605942 173
ppmdi e -o7 -m232 11 12 904 114 893 116 3169000 119
ppmdi e -o12 -m232 25 26 915 115 891 116 3113630 117
ppmdi e -o16 -m232 27 28 916 115 890 116 3100943 116
ppmn_km e -o6 -MT1 30 30 931 117 901 117 3132278 117
ppmn_km e -o8 -MT1 64 65 993 125 929 121 3107654 116
ppmn_km e -o9 62 63 990 125 929 121 3115560 117
ppmn_km e -o9 -M:50 49 50 949 119 900 117 3058436 115
ppmonstr e -o7 -m232 64 67 974 123 911 118 3035498 114
ppmonstr e -o8 -m232 71 74 980 123 910 118 3007964 113
ppmonstr e -o64 -m232 101 103 1020 128 920 119 2937387 110
qlfc a 22 11 973 122 952 124 3385084 127
rk -mf2 50 20 1108 139 1058 137 3735704 140
rk -mx1 144 143 1147 144 1004 130 3093640 116
rk -mx2 173 173 1203 151 1032 134 3086312 116
sbc c -b63 29 9 914 115 885 115 3151930 118
sbc c -os -b63 29 9 810 102 782 101 2779632 104
szip -o4 4 10 1027 129 1023 133 3647445 137
szip -o6 17 14 996 125 979 127 3475264 130
szip -o8 -b41 27 17 973 122 947 123 3348344 125
zzip a 21 11 977 123 956 124 3400243 127
zzip a -mx 22 12 973 122 952 124 3383060 127
zzip a -mx -30m 30 12 940 118 910 118 3233147 121
7za a -t7z 84 1 1112 139 1036 134 3694393 138
7za a -t7z -mx 111 1 1110 139 1010 131 3591954 134
7za a -tzip 23 2 1246 156 1225 159 4393623 164
7za a -tzip -mx 45 1 1268 159 1227 159 4401160 164
abc13 -c 20 9 950 119 931 120 3313820 124
abc24 -c 29 16 923 116 897 116 3159570 118
bee a -m1 72 71 1061 133 996 129 3303235 123
bee a -m2 142 137 1156 145 1027 133 3153498 118
bee a -m3 199 193 1253 157 1074 139 3100189 116
bee a -m3 -d6 175 169 1205 151 1048 136 3100537 116
bee a -m3 -s 340 356 1567 197 1261 163 3133527 117
durilca e -o10 -t2(30) 82 82 982 123 907 117 2941278 110
durilca e -o12 -t2(30) 85 85 983 123 906 117 2923546 109
durilca e -o16 -t2(30) 87 87 984 123 905 117 2912973 109
durilca e -o32 -t2(30) 90 90 988 124 907 117 2910279 109
durilca e -o128 -t2(30) 92 91 992 124 909 118 2910571 109
epm7 c012 144 142 1131 142 1001 129 3037532 113
epm7 c016 147 146 1138 143 1005 130 3038383 113
paq1 277 275 1410 177 1161 150 3088369 115
paq1sse 360 360 1550 194 1226 159 2988475 112
ppmy70 /o6 /m220 260 269 1454 182 1219 158 3327472 124
ppmy70 /o7 /m220 294 301 1484 186 1219 158 3199428 119
ppmy70 /o8 /m220 321 329 1514 190 1224 159 3108922 116
ppmy70 /o9 /m220 348 355 1549 194 1236 160 3046316 114
ppmy70 /o16 /m220 360 366 1810 227 1485 192 3899871 146
rar a -m1 13 1 1239 155 1227 159 4408290 165
rar a -m3 35 1 1155 145 1123 145 4026937 150
rar a -m5 24 16 920 115 898 116 3164814 118
rar a -m5 -s 24 16 922 116 900 116 3173148 118
rar a -mc16t -s 35 26 991 124 959 124 3347746 125
rar a -mc16t+ -s 35 26 991 124 959 124 3347746 125
rar a -mc16:128t -s 40 31 955 120 918 119 3180234 119
rar a -mc16:128t+ -s 40 31 955 120 918 119 3180234 119
rar32 a -mc16t -s 37 29 996 125 962 125 3347746 125
slim a -d32 -w21 555 552 1928 242 1428 185 2952807 110
slim a -d16 -w21 552 549 1922 241 1425 185 2952991 110
slim a -d8 -w21 537 534 1892 237 1408 182 2953730 110
slim a -d4 -w21 485 482 1788 224 1351 175 2954750 110
uharc a -m1 -md32768 63 5 1026 129 969 125 3446069 129
uharc a -m2 -md32768 100 5 980 123 890 115 3151572 118
uharc a -m3 -md32768 110 5 973 122 874 113 3087249 115
uharc a -mz -md32768 8 9 1084 136 1077 139 3842041 144
uharc a -mx -md32768 60 55 936 117 882 114 2953184 110
ybs -m1m 22 8 952 119 931 120 3316356 124
ybs -m2m 25 8 937 117 915 118 3255538 122
ybs -m4m 28 8 919 115 894 116 3178183 119
ybs -m8m 31 8 905 113 877 113 3116271 116
ybs -m16mu 33 9 835 105 805 104 2852642 106
ybs -m16mu -r 34 9 841 105 811 105 2874130 107
ybs_d -m16mu 34 9 836 105 805 104 2852642 106
Overall score is calculated by adding compression time, extraction time, and
time it would take to transfer the compressed file over a 28,800bps network:
(compressed_size)/3600
Average Users' score is calculated by adding (compress_time/10)+ extract_time +
time it would take to transfer the compressed file over a 28,800bps network.
Compression time is divided by 10 here, because more than 90% of people would
never compress anything during their life (with compression programs), but they
use compressed data almost _every_ time they use computers and/or Internet.
That's why compression time is not so actual for them.
[[3]] Details
=============
are no longer put to this main text
(thousands of lines reporting about 100,000 results on 1231 files in 6 sets),
but can be found in FULL version with TEXTS.DAT and *.BAT
at http://compression.ru/artest/artest25.zip
or http://artest1.tripod.com/artest25.zip
[[4]] Comments
==============
Links to download programs:
~~~~~~~~~~~~~~~~~~~~~~~~~~~
PPMD var.I,
PPmonstr v.I :W http://compression.ru/ds/ppmdi1.rar
Durilca 0.1a :W http://compression.ru/ds/durilca.rar
PAQ1SSE :W http://compression.ru/so/paq1sse.zip
EPM 7 :W http://compression.ru/so/epm_7.zip
YBS 0.03f :e http://compression.ru/ybs/ybs003fd.zip
YBS 0.03f :W http://compression.ru/ybs/ybs003fw.zip
BEE 0.7.6 :W http://compression.ru/fa/files2/bee076d.rar
PPMN_km b4 :W http://compression.ru/ms/ppmn_km.rar
PPMY 3c+sse :W http://compression.ru/sh/ppmy_3c_sse.rar
ERI 5.1fre :e http://compression.ru/artest/eri51fre.zip
7-Zip 2.30b32 :W http://www.7-zip.org/dl/7z230b32.exe
WinRAR 3.20 :W http://www.rarlab.com/rar/wrar320.exe
RAR32 3.20 :e http://www.rarlab.com/rar/rarx320.exe
Bzip2 1.0.2 :W ftp://sources.redhat.com/pub/bzip2/v102/bzip2-102-x86-win32.exe
ABC 1.3 :W http://www.data-compression.info/ABC/abc_13.zip
ABC 2.4 :W http://www.data-compression.info/ABC/abc_24.zip
ACB 2.00c :e ftp://ftp.simtel.net/pub/simtelnet/msdos/compress/acb_200c.zip
ACE 2.04 :W http://winace.host.sk/ace204.exe
ArHanGeL 1.40 :a http://geocities.com/SiliconValley/Lab/6606/arh140.zip
BA 1.01b5 :e http://hem.spray.se/mikael.lundqvist/ba101br5.zip
BOA 0.58b :e ftp://ftp.elf.stuba.sk/pub/pc/pack/boa058.zip
Compressia 1.0b :W http://www.compressia.com/compressia.exe
DC 0.98b :W ftp://ftp.elf.stuba.sk/pub/pc/pack/dc124.zip
GCac 0.9k :W http://www.emit.jp/gca/gca_v09k.exe
Imp 1.1 :e http://www.technelysium.com.au/imp110d.zip
Imp-win 1.12 :W http://www.technelysium.com.au/imp112.exe
PAQ1 :W http://cs.fit.edu/~mmahoney/compression/paq1.exe
PkzipC 4.00 :W ftp://ftp.pkware.com/pkzc400s.exe
PkZip 2.50 :a ftp://ftp.simtel.net/pub/simtelnet/msdos/arcers/pk250dos.exe
QLFC 6.6W :W http://ghido.shelter.ro/Archive/DownloadQLFC.php
RK-dos 1.04.1 :e http://rksoft.virtualave.net/downloads/rk104a1d.exe
RK 1.04.1 :W http://rksoft.virtualave.net/downloads/rk104a1w.exe
SBC_d 0.969br1 :e http://personal.inet.fi/musiikki/sjm/sbc0969b_dos.zip
SBC 0.969br1 :W http://personal.inet.fi/musiikki/sjm/sbc0969b_win32.zip
Slim b13 :W http://www.slim-fb.by.ru/files/slim0013.zip
SZip 1.12a :W http://www.compressconsult.com/szip/szip_112a_win32.zip
UHArc 0.4b :eW ftp://ftp.elf.stuba.sk/pub/pc/pack/uharc04.zip
ZZip 0.36c :W http://debin.org/zzip/files/zzip-win32.zip
:a - any DOS - DOS programs, will run under pure DOS or in a DOS box
:e - extender - DOS programs using DOS extenders like DOS/4GW or CWSDPMI
:W - windows - Windows95/98/NT/etc programs
If direct link doesn't work-most probably newer version of the program appeared
at the same site: visit web page, or read the whole directory from ftp server
(i.e. try the same URL, but without filename).
Homepages:
~~~~~~~~~~
PPMD,PPMonstr,
Durilca : http://compression.ru/ds
EPM : http://compression.ru/so
YBS : http://compression.ru/ybs
BEE : http://compression.ru/fa
PPMN : http://compression.ru/ms
PPMy : http://compression.ru/sh
Eri32 : http://compression.ru/artest
mirror : http://artest1.tripod.com
7-Zip : http://www.7-zip.org
RAR,WinRAR : http://www.rarlab.com
ACE,WinACE : http://www.winace.com
PkZip : http://www.pkware.com
BZip2 : http://sources.redhat.com/bzip2
SZip : http://www.compressconsult.com/szip
ABC : http://www.data-compression.info
Arhangel : http://geocities.com/SiliconValley/Lab/6606
BA : http://hem.spray.se/mikael.lundqvist
Compressia : http://www.compressia.com
GCAC : http://emit.jp/gca/gca.html
Imp,WinImp : http://www.technelysium.com.au/winimp.html
PAQ1 : http://cs.fit.edu/~mmahoney/compression/
RK : http://rksoft.virtualave.net
SBC : http://sbcarchiver.netfirms.com
QLFC : http://ghido.shelter.ro
Slim : http://www.slim-fb.by.ru
ZZip : http://debin.org/zzip/
ShipInBottle: http://shipinbottle.narod.ru
What's new:
~~~~~~~~~~~
14 new programs were tested:
7-zip 2.30b32
RAR 3.20
ABC 1.3
ABC 2.4
UHArc 0.5np2
EPM 7
Slim b13
BEE 0.7.6
Durilca 0.1a
PPMy 3c+sse
PAQ 1
PAQ 1SSE
YBS 0.03f
Compressia 1.0b
Latest beta versions of DC, Entropy, UHArc were available
from authors by e-mail request:
Entropy: artest@inbox.ru
DC: EdgarBinder@t-online.de
UHArc: Uwe.Herklotz@gmx.de
Results of many other programs are in full version only, TEXTS.DAT file.
The set of Russian texts is at http://arte.nm.ru/m120
WARNINGS:
~~~~~~~~~
BA 1.00beta5 can't correctly decompress shaks12.txt and set used for speed
measurements.
DC 0.99.158b failed to decompress 1DFRE10.dc , ANDES10.dc , and BTI0110.dc,
saying "Corrupted block" (while t(est) command writes "Test successful").
Problems in all other compressors were not found.
ESP, Rkive and many other programs are not tested any more,
their results and links can be found in previous volumes of ARTest.
The LATEST RELEASE, and all previous volumes can be found
at http://compression.ru/artest/
Send your suggestions, comments to artest@inbox.ru
With best kind regards,
A.Ratushnyak
Back to main ARTest page
|
|
Последнее обновление:
05-June-2003
|
Сайт о сжатии >>
Новинки |
О сервере |
Статистика
Книга "Методы сжатия данных" >>
Универсальные |
Изображений |
Видео
Разделы >>
Download (статьи+исходники) |
Ссылки |
Ru.compress |
Arctest |
Видео |
Каталог ссылок |
Форум
Проекты >>
Д.Ватолина
|
А.Ратушняка |
М.Смирнова |
В.Юкина |
Е.Шелвина |
А.Филинского |
Д.Шкарина |
С.Оснача
|
 |
 |
 |
 |
 |
 |
|