dmalinovsky / kindle-hyphens Goto Github PK
View Code? Open in Web Editor NEWSoft hyphens for FB2 ebook format
Soft hyphens for FB2 ebook format
I've ran your main.py on my fb2 book and then converted it to mobi via web service. Then I've sent to kindle via email (I don't know what actual steps matter) and I don't see any hyphens.
Will need to rename the project probably then. :)
It'll make it user-friendly and greatly increase its usefulness.
А есть возможность добавить и немецкие переносы тоже?
Спасибо!
Add rest of the languages of the TeX project. Use XML for this purpose.
Решил обработать дамп либрусека - очень много ошибок типа такой:
Processing /media/storage/books/lib.rus.ec/4100.fb2... Done.
Traceback (most recent call last):
File "/home/alex/rails/utils/fb2-hyphens/main.py", line 58, in <module>
dom = parse_xml(input_file)
File "/home/alex/rails/utils/fb2-hyphens/main.py", line 15, in parse_xml
dom = parse(input_file)
File "/usr/lib/python2.7/xml/dom/minidom.py", line 1920, in parse
return expatbuilder.parse(file)
File "/usr/lib/python2.7/xml/dom/expatbuilder.py", line 924, in parse
result = builder.parseFile(fp)
File "/usr/lib/python2.7/xml/dom/expatbuilder.py", line 207, in parseFile
parser.Parse(buffer, 0)
xml.parsers.expat.ExpatError: not well-formed (invalid token): line 572, column 175
Как я понимаю, ошибки в самой книге, но ридерам же на них пофиг - можно ли сделать, чтобы скрипт не реагировал так жестоко, а пытался обработать такие файлы все равно? Файлы, обрабатывающиеся с ошибками могу выложить.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.