Antenati Book Downloader (es)

La finalidad de este programa es descargar libros de actas de la página web Antenati. Todas las páginas del libro serán almacenadas en una nueva carpeta, en la mayor resolución posible. De esta manera, podrás buscar actas más rápido usando la aplicación de fotos de tu PC, en vez de el visualizador de Antenati, que es lento e incómodo.

Ya que todas las imágenes están en alta resolución, el script tardará un tiempo en completar. Un libro de 800 páginas podrá tardar hasta una hora en descargarse, y ocupará aproximadamente 1GB de almacenamiento.

Deberás tener Python instalado con las siguientes librerias (use pip install para instalarlas):

requests
shutil
pathlib
json
glob

Asimismo, deberás usar Jupyter para abrir el notebook: https://jupyter.org/install

Para descargar un libro, sigue las instrucciones:

Localiza el IIIF manifest en la sección 'Contenuti collegiati' del visualizador del libro:

Haz click derecho en el link, y selecciona 'Guardar link como'. Asegúrate de guardar el archivo JSON en el directorio del script.
Descarga tantos manifests como desees. Cuando corras el Jupyter Notebook, empezará a descargar todos los libros.
Todos los libros serán descargados en una carpeta separada. Asegurate de eliminar los archivos JSON después de descargar un libro, asi evitar que se vuelva a descargar la próxima vez que ejecutes el script.
Podrás descargar libros individualmente corriendo la función 'downloadBook' por separado.

Antenati Book Downloader (en)

This purpose of this script is to download books from the webpage Antenati. All the pages will be stored in a newly created folder, at the highest resolution possible. This way, you can look for records faster with your PC's photo application, instead of the slower webpage viewer.

Thake in mind that since all the images are on high resolution, the script will take a while to complete. A 800 page book may take up to one hour to download, and will occupy approximately 1GB of space.

You must have Python installed with the following libraries (use pip install to install them):

requests
shutil
pathlib
json
glob

Additionaly, you must use Jupyter to open the notebook: https://jupyter.org/install

To download a book, follow these instructions:

Locate the IIIF manifest from the 'Contenuti collegati' section in the registry book viewer:

Right click on the link, and select 'Save link as'. Make sure you save the JSON file on the directory the jupyter notebook is located
Download as many manifests as you like. When you run the jupyter notebook, it will start downloading all the books.
All the books will be downloaded on a separate folder. Make sure to remove the JSON files once you downloaded a book, as to not download it again next time you run the script
You can download individual books by running the 'downloadBook' function separately.

gcufaro / antenati-book-downloader Goto Github PK

antenati-book-downloader's Introduction

NOTA: Lamentablemente, los archivos IIIF de cada libro son privados, por lo que esta solución no se puede utilizar. Sin embargo, se pueden bajar imágenes individuales manualmente.

NOTE: Unfortunately, IIIF files of each book are private, so this solution cannot be used. However, you can still download individual images manually.

Antenati Book Downloader (es)

Antenati Book Downloader (en)

antenati-book-downloader's People

Contributors

Stargazers

Watchers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent