Trata PDF para torná-lo compatível com PDF/X e com impressoras em escala de cinza.

Last update: Nov 30, 2021

Related tags

PDF Files Processing tratapdf

Overview

tratapdf

Trata PDF para torná-lo compatível com PDF/X e com impressoras em escala de cinza.

dependências

icc-profiles
ghostscript
visualizador de PDF (nesta versão usamos o evince hardcoded)

instalação no Debian 11

clonar o repositório;
configurar as variáveis:
- BASE_DIR: diretório onde ficarão os recursos;
- GHOSTSCRIPT: caminho para o executável;
- PDF_VIEWER: caminho para o executável;
copiar os arquivos abaixo para o BASE_DIR:
- /usr/share/ghostscript/9.53.3/lib/PDFX_def.ps;
- /usr/share/color/icc/ISOuncoated.icc.
editar o PDFX_def.ps, substituindo o que está entre () na linha começada por /ICCProfile para o caminho para o ISOuncoated.icc recém copiado. Exemplo: /ICCProfile (/tmp/tratapdf/ISOuncoated.icc) def.

como rodar?

Executar python3. Outra opção é criar um launcher para o programa. Nesse caso é recomendável usar o path completo do Python bem como marcar o launcher como executável.

Será criado um arquivo com o mesmo nome do tratado, mas com o sufixo -tratado.

Trata PDF para torná-lo compatível com PDF/X e com impressoras em escala de cinza.

Related tags

Overview

tratapdf

dependências

instalação no Debian 11

como rodar?

Owner

Table automatically extraction from PDF Document

Camelot is a Python library that can help you extract tables from PDFs!

borb is a library for reading, creating and manipulating PDF files in python.

Generate a preview image for a PDF.

Convert given source code into .pdf with syntax highlighting and more features

WeasyPrint is a smart solution helping web developers to create PDF documents.

Simple python tool created for downloading PDF.

An application which enables the users to perform simple yet intriguing PDF operations

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

PDFSanitizer - Renders possibly unsafe PDF files and outputs harmless PDF files

A python library for extracting text from PDFs without losing the formatting of the PDF content.

Performing the following operations using python on PDF.

PyMuPDF is a Python binding with support for MuPDF

pikepdf is a Python library for reading and writing PDF files.

Extract the table in the PDF，outputs the data similar to the json format

Camelot is a Python library that makes it easy for anyone to extract tables from PDF files

Convert PDF to AudioBook and Audio Speech to PDF

Python PDF Parser (Not actively maintained). Check out pdfminer.six.

Scans pdfs for links written in plaintext and checks if they are active or returns an error code.

Simple HTML and PDF document generator for Python - with built-in support for popular data analysis and plotting libraries.