고정 Dictionary 기반 압축

LZ77 알고리즘은 search buffer, lookahead buffer로 구분하고, 뒤에 있는 버퍼가 앞에 있는 버퍼에서 매칭되는 부분을 축약하는 형태의 압축입니다. 결국 뒤 버퍼는 앞 버퍼를 마치 사전처럼 사용합니다. 따라서 LZ77 알고리즘을 Dictionary Compression Method를 사용하는 알고리즘이라고 할 수 있습니다.

만약 입력되는 데이터에 대한 정보를 알고 있다면 search buffer를 사용자가 제공하는 버퍼를 사용하도록 할 수 있습니다. 따라서 search buffer를 최소화하고, 성능이 빠른 압축이 가능해집니다.

deflateSetDictionary 함수와 inflateSetDictionary 함수를 사용해서 Dictionary Buffer를 지정할 수 있습니다.

동영상 강좌 예제 코딩
// Sample3_Dict.cpp : Defines the entry point for the console application.
//

#include "stdafx.h"

#include <LibZ/zlib.h>
#include <iostream>

int _tmain(int argc, _TCHAR* argv[])
{
   const Bytef dic[]="HELLO";
   const int BUF = 4096;

   //압축시 데이터와 데이터 길이
   Bytef deflate_out[BUF];
   int deflate_size(0);

   //압축 해제 데이터와 데이터 길이
   Bytef inflate_out[BUF];
   int inflate_size(0);

   // 압축 계산.
   do{
      z_stream stream;
       stream.zalloc = Z_NULL;
       stream.zfree = Z_NULL;
       stream.opaque = Z_NULL;
      int ret = deflateInit(&stream, Z_DEFAULT_COMPRESSION);
      if( Z_OK!=ret) break;

       deflateSetDictionary(&stream, dic, strlen( (const char*)dic ) );

      Bytef in[] ="HELLOaHELLObHELLOaHELLObHELLOaHELLObHELLOaHELLObHELLOaHELLObHELLOaHELLOb";
       std::cout<<"Raw Data Size:"<<strlen( (const char*)in)<<std::endl;

       stream.next_in = in;
       stream.avail_in = strlen( (const char*)in);

       stream.next_out = deflate_out;
       stream.avail_out = BUF;

      do {
          deflate(&stream,Z_FINISH);
      } while( 0 != stream.avail_in);

       deflate_size = BUF - stream.avail_out;
       std::cout<< "Deflate Size:"<<deflate_size<<std::endl;

       deflateEnd(&stream);

   }while(false);

   /// 압축 해제
   do {
      z_stream stream;
       stream.zalloc = Z_NULL;
       stream.zfree = Z_NULL;
       stream.opaque = Z_NULL;
      int ret = inflateInit(&stream);
      if( Z_OK!=ret) break;

      //입력 데이터
       stream.next_in = deflate_out;
       stream.avail_in = deflate_size;

      //출력 버퍼
       stream.next_out = inflate_out;
       stream.avail_out = BUF;

      do {
          ret = inflate(&stream, Z_NO_FLUSH);
         if (Z_NEED_DICT == ret){
             inflateSetDictionary(&stream, dic, strlen( (const char*) dic ) );
         }

      } while(0!=stream.avail_in);

       inflate_size = BUF - stream.avail_out;
       std::cout<<"Inflate size:"<<inflate_size<<std::endl;

       inflate_out[inflate_size]=NULL;
       std::cout<<"Inflate Data:"<<(const char*) inflate_out<<std::endl;

       inflateEnd(&stream);
   }while(false);

    system("pause");

   return 0;
}

zlib 개요 zlib msvc로 포팅하기 1 zlib msvc로 포팅하기 2 Block과 Flush Deflate 샘플 Inflate 샘플 고정 Dictionary 기반 압축 Block 단위 압축 및 해제 zalloc, zfree, opaque 사용 예 compress,uncompress 함수 사용하기 간단히 zip 파일 생성하기 간단히 zip 파일 해제하기 zip 파일 포맷 간단히 알아보기 폴더 압축해보기 압축 파일 폴더에 풀기 Self Extractor 만들기 zip IO 핸들링 암호걸린 zip 생성 및 해제 zip 파일에서 특정 파일만 지우기 gz 파일 사용하기

13 thoughts on “gz 파일 사용하기”

DENIS DOS SANTOS SILVA says:

2016/11/11 at 5:56 am

thanks you very!

Paul Kim says:

2018/07/04 at 10:38 am

안녕하세요 좋은 영상과 글을 잘 봤습니다.
글에 보면 “입력 버퍼와 출력 버퍼를 고정으로한 압축 함수”에 대해서 어떤 함수를 봐야 하는지 알수 있을까요?
좋은 글 감사드리며, 오래된 글에 문의를 남깁니다.
감사합니다.

- admin says:
  
  2018/07/24 at 2:35 pm
  
  [compress,uncompress 함수 사용하기] 동영상입니다.
  
iskra says:

2018/09/14 at 1:42 am

동영상 잘 보았습니다. 궁금한것이 있는데요, Z_SYNC_FLUSH 를 사용할 경우에, 만약 next_out buffer 크기가 압축한 data 를 담기에 충분하지 않는다면, 어떻게 해야 하는지요?

- admin says:
  
  2018/10/25 at 2:31 am
  
  Z_SYNC_FLUSH는 쓰고 있는 비트를 모두 바이트 단위로 쓰라는 옵션임으로 일반적인 쓰기 과정에서 발생한 에러 코드를 반환하게 되어 있습니다. [Deflate 샘플] 동영상을 참고하세요.
  
hinata says:

2019/08/16 at 5:15 am

C로 압축하고 c#으로 압축을 푸는 방법이 있을까요?
텍스트 파일을 작성시

gzwritre(파일 포인터, 2019, sizeof(int));
gzwritre(파일 포인터, 09, sizeof(int));
gzwritre(파일 포인터, 17, sizeof(int));

gzwritre(파일 포인터, 0.1, sizeof(float)); 결과값
gzwritre(파일 포인터, 0.2, sizeof(float));
gzwritre(파일 포인터, 0.3, sizeof(float));
…
gzwritre(파일 포인터, 100.0, sizeof(float));

이렇게 텍스트 파일을 저장(압축)했는데 C#에서 이 파일을 읽어서

연 / 월 / 일 을 정수 변수에
결과값을 실수배열에 저장하고 싶습니다.

- admin says:
  
  2019/08/16 at 6:48 am
  
  C#으로 포팅된 zip라이브러리를 사용하시면 문제없이 될겁니다. 예를 들어 SharpZipLib과 같은~
  http://icsharpcode.github.io/SharpZipLib/
  https://github.com/icsharpcode/SharpZipLib/wiki/Zip-Samples
  https://icsharpcode.github.io/SharpZipLib/help/api/ICSharpCode.SharpZipLib.GZip.GZip.html
  
rust says:

2019/11/26 at 3:19 am

분할 압축파일일 경우 zilb로 압축 해제가 가능 한가요?
아니면 다른 라이브러리를 사용해야 할까요 ?

- admin says:
  
  2020/02/28 at 12:07 pm
  
  별도로 지원하는 함수는 없지만 그냥 저장하는 과정에서 파일만 나눠 저장하는 것이 아닐지요.
  
hyunyoung_eom says:

2020/11/02 at 10:42 am

안녕하세요
c++에서 zlib uncompress로 .emf image 파일을 압축해제 하려고 합니다.
그런데 손상된 파일이을 대상으로 진행할 경우 Z_DATA_ERROR을 반환하면서 자동종료됩니다.
압축된 파일이 Z_DATA_ERROR를 반환하면(손상된 파일이면) 빈 .emf image 파일로 압축해제 되도록 하려면 어떻게 해야할까요?

- admin says:
  
  2020/11/21 at 2:54 pm
  
  Z_DATA_ERROR이면 그냥 빈.emf image 파일을 생성해주면 되는 것 아닌가요?
  
yessare says:

2021/03/14 at 3:02 am

리눅스에서 C언어를 사용해서 zlib으로 위와 같이 폴더(디렉토리)를 압축할수있나요?

- admin says:
  
  2021/05/04 at 7:52 am
  
  libzip에 포함된 contrib/minizip를 함께 사용하시면 가능합니다.