Comments (22)
also I got just two commands after completing bitextor-train-document-alignment && bitextor-get-html-text only.
from bitextor.
from bitextor.
@iyadahmad I also have problems with bitextor & Ubuntu 16.04, I now use Ubuntu 18.04. You may want to upgrade, or give more detailed descriptions so that the issue can be debugged
from bitextor.
from bitextor.
yes the master branch , btw I've tried with ubuntu 18.04 and the result the same , I will attach log files.
from bitextor.
I followed the instructions in this files https://github.com/bitextor/bitextor/blob/master/README.md
autogen.log
config.log
makeinstalllog.log
makelog.log
from bitextor.
CMakeError-kenlm.log
CMakeError-mgiza.log
CMakeError-preprocess.log
CMakeOutput-kenlm.log
CMakeOutput-mgiza.log
CMakeOutput-preprocess.log
from bitextor.
Looks like some parts are missing from boost or something is broken with libboost or cmake versions, maybe they are too old. This are the most important errors here (looking at mgiza CMakeError log file):
error: ‘std::tr1’ has not been declared
...
CheckSymbolExists.c:(.text+0x1b): undefined reference to `pthread_create'
...
/usr/bin/ld: cannot find -lpthreads
Check if libboost-all-dev
installed
from bitextor.
thanks for your quick reply ,
libboost-all-dev already installed and the version is 1.65.1
cmake version 3.10.2
from bitextor.
Yesterday , I try to install the latest version of bitextor. There is no error message in the installing process, but in the end, I can't use the "bitextor" command to crawl data.
The installing process described as following:
run the script 'configure'
./autogen.sh
make
sudo make install
no error message but can't run bitextor command, I am sure that there is no PATH problem.
from bitextor.
no error message but can't run bitextor command.
I install to a specific path ,and find that only has the command of the following graph:
from bitextor.
from bitextor.
Forget my previous message. There is an issue in master
branch which is fixed in our development branch. Please, use the released tag Bitextor 6 package meanwhile we fix it: https://github.com/bitextor/bitextor/releases/tag/v6
from bitextor.
Thanks for your reply, lpla.
The above bitextor link is Bitextor 6, and the following links is the Bitextor 7? And I also want to konw when I can formally use it? Thanks.
https://github.com/bitextor/bitextor/
from bitextor.
from bitextor.
Ok, now master is fixed, as @mespla said.
Some notes. Bitextor 7 is the next stable release of Bitextor, which is now in development. We manage our code similarly as Debian does:
tags
are for stable releases that have support until next stable comes, which is Bitextor 6 series (well, 6.0.1 now thanks to your issue with the '\s' insed
, so thanks!). That's why I recommended to use it until we fixed master.master
is our testing (Debian equivalent) in which we integrate all the new features we finished (or reached a milestone) for next stable release, like full Python 3 compatibility, WARC format support and more. It can be broken sometimes (today, for example, but now it is working again).- Rest of
branches
are our development playground for new features and they will be probably not even documented in the Wiki or README.md and, of course, prone to be broken.
So, please, @iyadahmad and @JiaYueHuang , test now your installation pulling updates for master and report any other issue. In case you find more issues and we take long to answer, please, try with Bitextor 6.x series.
from bitextor.
Thanks~
from bitextor.
Hello ,
Master , still "bitextor" binary doesn't exist just [/usr/local/bin/bitextor-get-html-text
and /usr/local/bin/bitextor-train-document-alignment]
CMakeError-kenlm.log
CMakeError-mgiza.log
CMakeError-preprocess.log
CMakeOutputkenlm.log
CMakeOutput-mgiza.log
CMakeOutput-preprcess.log
from bitextor.
i will test v6.x tag and will write my feedback .
from bitextor.
V6.x notes
when run ./autogen.sh >> configure: error: this package requires the following libraries for Python:
- keras: Please, install v1.0.3 or later and try again: sudo pip install keras
but keras is already installed with pip3 [Keras (2.2.4)]
here are the used steps from my side :
- git submodule update --init --recursive
- sudo apt install cmake g++ automake pkg-config openjdk-8-jdk python3 python3-pip python3-magic libboost-all-dev maven libbz2-dev liblzma-dev zlib1g-dev libffi-dev
- sudo pip3 install -e git://github.com/mammadori/magic-python.git#egg=Magic_file_extensions
- sudo pip3 install --upgrade python-Levenshtein tensorflow keras iso-639 langid nltk regex h5py ftfy warc3-wet
- sudo apt install httrack
- sudo pip3 install -r bicleaner/requirements.txt
- sudo pip3 install --upgrade matplotlib sklearn numpy && sudo apt install python3-tk
- ./autogen.sh
- make
- sudo make install
from bitextor.
from bitextor.
Thanks a lot, it works now .
from bitextor.
Related Issues (20)
- Error in rule bicleaner HOT 7
- Install Alcazar HOT 3
- Process completes without error but does not produce any sentence pairs HOT 1
- Urdu sentence alignment HOT 7
- Inconsistent behaviour of paths in .yaml file HOT 1
- How do you compare two different domains HOT 2
- Problem when run bitextor using document aligner NMT HOT 6
- Document aligner happily returns nothing with piped input HOT 1
- Custom Word Tokenizer Error HOT 3
- CMake build failed v8.1.1 HOT 6
- Bitextor crashes if Bicleaner filters all lines
- Hunalign and Bicleaner errors HOT 3
- Bleualign error HOT 4
- custom_translate getting called without externalMT HOT 2
- External embeddings
- Instruction on running bitextor_align_segments.py for Hunalign only? HOT 1
- Only first file in warc file appears to be processed when "directories" is used as data source HOT 4
- New Bicleaner AI full models HOT 1
- Document level granularity of Paracrawl HOT 1
- Bitextor usage HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bitextor.