Compression
Size of…
Original Text: 5.32 MB
Zip File: 1.84 MB
.7z: 1.50 MB
.BZ2: 1.41MB
.7z Ultra: 1.49 MB
Best Compression: .BZ2
Bzip2 is a free and open source lossless data compression algorithm and program. Bzip2 compresses most files more effectively than the older .zip compression algorithms, but is considerably slower. Bzip2 is only a data compressor. It is not an archiver like ZIP; the program itself has no facilities for multiple files, encryption or archive-splitting, but, in the UNIX tradition, relies instead on separate external utilities for these tasks. Bzip2 uses the Burrows-Wheeler transform to convert frequently-recurring character sequences into strings of identical letters, applies a move-to-front transform, and finally uses Huffman coding. Bzip2 is known to be quite slow at compressing, but this problem is asymmetric, as decompression is relatively fast. The ZIP file format is a data compression and archive format. A ZIP file contains one or more files that have been compressed to reduce file size, or stored as-is. ZIP is a simple archive format that compresses every file separately. Compressing files separately allows for individual files to be retrieved without reading through other data; in theory, it may allow better compression by using different algorithms for different files. A caveat to this is that archives containing a large number of small files end up significantly larger than if they were compressed as a single file. 7z is a compressed archive file format that supports several different data compression, encryption and pre-processing filters. The 7z format provides the following main features: open, modular architecture which allows any compression, conversion, or encryption method to be stacked, high compression ratios (depending on the compression method used), strong Rijndael/AES-256 encryption, large file support (up to approximately 16 exabytes), unicode file names, support for solid compression, where multiple files of like type are compressed within a single stream, in order to exploit the combined redundancy inherent in similar files, and compression and encryption of archive headers. The format’s open architecture allows additional future compression methods to be added to the standard. The 7z format does not store UNIX owner/group permissions, and hence can be inappropriate for backup/archival purposes.
There are situations in which one type of compression method would be superior to others. Bzip2 is the most effective compression method, but it is slower. Therefore, if time is an issue a person might opt to use a zip compression method. If size were the issue, however, bzip2 would be the best compression choice. The zip format compresses each file separately. If a person has several files that might need to be retrieved separately, this would be the best choice. Also, this is the best choice for archiving. However, this would not be the best choice if size were an issue. 7z would be the best choice if a person wants encryption abilities while using a small amount of space. However, this format should not be used for backup or archiving purposes.




