A small tool that converting tables in a pdf file to a single csv file.
A small tool that scrapes all available tables in an input pdf file, then merging them into a single dataframe and save it as a csv file.
- Tabula-py
- Pandas
- Run the program on a Command Line Interface (CLI):
#!/bin/bash
python scraper.py -p <path_to_the_input_pdf_file>