saarthdeshpande / book-summarizer Goto Github PK
View Code? Open in Web Editor NEWUsing pretrained T5 model for abstractive summarization of books
License: MIT License
Using pretrained T5 model for abstractive summarization of books
License: MIT License
'Corrections'
Traceback (most recent call last):
File "/content/book-summarizer/model.py", line 107, in summaryGeneration
summary = sentenceCorrection(summary)
File "/content/book-summarizer/model.py", line 43, in sentenceCorrection
sentenceDict = parser.parse(sentence)
File "/usr/local/lib/python3.7/dist-packages/gingerit/gingerit.py", line 27, in parse
return self._process_data(text, data)
File "/usr/local/lib/python3.7/dist-packages/gingerit/gingerit.py", line 39, in _process_data
for suggestion in reversed(data['Corrections']):
KeyError: 'Corrections'
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "bsCLI.py", line 20, in
pdfParser(app.config['PDF_UPLOADS'] + '/pdf_file.pdf')
File "/content/book-summarizer/preprocess.py", line 96, in pdfParser
splitChapters(filename, mailid)
File "/content/book-summarizer/preprocess.py", line 75, in splitChapters
summaryGeneration(mailid)
File "/content/book-summarizer/model.py", line 120, in summaryGeneration
send_fail(mailid)
File "/content/book-summarizer/mail.py", line 17, in send_fail
session.login(sender_address, sender_pass) # login with mail_id and password
File "/usr/lib/python3.7/smtplib.py", line 730, in login
raise last_exception
File "/usr/lib/python3.7/smtplib.py", line 721, in login
initial_response_ok=initial_response_ok)
File "/usr/lib/python3.7/smtplib.py", line 642, in auth
raise SMTPAuthenticationError(code, resp)
smtplib.SMTPAuthenticationError: (535, b'5.7.8 Username and Password not accepted. Learn more at\n5.7.8 https://support.google.com/mail/?p=BadCredentials v64sm11061838pfc.117 - gsmtp')
ImportError:
T5Tokenizer requires the SentencePiece library but it was not found in your environment. Checkout the instructions on the
installation page of its repo: https://github.com/google/sentencepiece#installation and follow the ones
that match your environment.
I have checked on the readme I didn't see anything relating to token except when you were explaining the project.
Converting PDF to txt file.
Successfully converted PDF to txt.
Total Number of Lines: 13596
No chapters in book! Writing entire book!
Done writing!
pdf_fileChapterAll.txt
Summarising: pdf_fileChapterAll.txt
Token indices sequence length is longer than the specified maximum sequence length for this model (536 > 512). Running this sequence through the model will result in indexing errors
Traceback (most recent call last):
File "/content/book-summarizer/bsCLI.py", line 19, in
copyfile(args.path, app.config['PDF_UPLOADS'] + '/pdf_file.pdf')
File "/usr/lib/python3.7/shutil.py", line 121, in copyfile
with open(dst, 'wb') as fdst:
FileNotFoundError: [Errno 2] No such file or directory: 'static/pdf/uploads/pdf_file.pdf'
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.