Softwares
Names | Stars & Forks | Issues | PRs | Downloads |
---|---|---|---|---|
pyCirclize | ||||
pyGenomeViz | ||||
pyMSAviz | ||||
ANIclustermap | ||||
COGclassifier | ||||
phyTreeViz | ||||
pybarrnap |
Genome-wide gene gain/loss mapping tool using DTL(Duplication-Transfer-Loss) reconciliation method
License: GNU General Public License v3.0
Names | Stars & Forks | Issues | PRs | Downloads |
---|---|---|---|---|
pyCirclize | ||||
pyGenomeViz | ||||
pyMSAviz | ||||
ANIclustermap | ||||
COGclassifier | ||||
phyTreeViz | ||||
pybarrnap |
IQ-TREE randomly resolves the multifurcation of the bootstrap tree (branch length = 0.0000000000).
However, it is assumed that the randomly resolved topology leads to undesirable results such as a significant increase in DTL cost.
Therefore, it may be effective to resolve multifurcation based on the topology of species tree to reduce the DTL cost generation instead of resolving random topology.
When multifurcation is detected in the bootstrap tree, I'd like to consider implementing a method to resolve it based on the topology of species tree.
Add keywords
and classifiers
FastDTLmapper does not provide a list of predicted results for horizontal gene transfer for each gene.
In order to improve the usability, we would like to consider providing the list of each horizontal gene transfer in the following TSV format.
e.g. Format example
OG_ID GENE_ID TransferPath
OG000000 Species0_GENE0 N00X -> Species0
OG000000 Species0_GENE1 SpeciesX -> Species0
AnGST.py script shabang is #!/usr/bin/python
now.
This shebang line does not work in some environments.
It should be changed to #!/usr/bin/env python2
because it may be recognized as python3.
config.py has a number of responsibilities.
It is responsible for processing arguments, storing output paths, getting parsing commands, and so on.
Therefore, I'd like to divide them to make the code more concise.
Divide class candidate
Change to recommended project structure
Add project details to pyproject.toml
for PyPI publish.
Test environment
FastDTLmapper limits the maximum number of bootstrap trees to be used in DTL reconciliation by AnGST to 100.
Since the maximum number of 100 is a Magic Number in the code, we would like to be able to control it as an optional argument.
FastDTLmapper/bin/AnGST_wrapper.py
Line 41 in e668b32
Better design.
hi,
while using fastdtlmapper i got this error
then i installed diamond
thereafter also i am getting same error, kindly look into this error
ERROR: Cannot run diamond
Format of make database command:
diamond makedb --in INPUT -d OUTPUT
ERROR: Cannot run diamond
Format of search database command:
diamond blastp -d DATABASE -q INPUT -o OUTPUT --more-sensitive -p 1 --quiet -e 0.001 --compress 1
Please check diamond is installed and that the executables are in the system path
ERROR: An error occurred, please review the error messages they may contain useful information about the problem.
0% 0:0=0s
Finished: 2022/09/22 19:28:50 (Elapsed time = 0.000[h])
0% 0:0=0s
Finished: 2022/09/22 19:28:50 (Elapsed time = 0.000[h])
0% 0:0=0s
Finished: 2022/09/22 19:28:51 (Elapsed time = 0.000[h])
0% 0:0=0s
Finished: 2022/09/22 19:28:51 (Elapsed time = 0.000[h])
0% 0:0=0s
Finished: 2022/09/22 19:28:51 (Elapsed time = 0.000[h])
Traceback (most recent call last):
File "/home/pbl2-ec/miniconda3/bin/FastDTLmapper", line 8, in
sys.exit(main())
File "/home/pbl2-ec/miniconda3/lib/python3.9/site-packages/fastdtlmapper/scripts/FastDTLmapper.py", line 57, in main
aggregate_and_map(args, outpath)
File "/home/pbl2-ec/miniconda3/lib/python3.9/site-packages/fastdtlmapper/util/time.py", line 12, in wrap
ret = func(*args, **kargs)
File "/home/pbl2-ec/miniconda3/lib/python3.9/site-packages/fastdtlmapper/scripts/FastDTLmapper.py", line 245, in aggregate_and_map
output_aggregate_map_results(outpath, group_id2all_node_event)
File "/home/pbl2-ec/miniconda3/lib/python3.9/site-packages/fastdtlmapper/scripts/FastDTLmapper.py", line 332, in output_aggregate_map_results
UtilTree.map_node_event(
File "/home/pbl2-ec/miniconda3/lib/python3.9/site-packages/fastdtlmapper/util/tree.py", line 96, in map_node_event
node_event = node_id2node_event[node.name]
KeyError: 'N001'
Gain/Loss map visualization using Seaview is enough useful for data analysis.
However, seaview gain/loss map figure is not publication-ready quality.
In order to improve the convenience of user, I'd like to add program that generate
gain/loss map figure nicely.
Formatter = black
Linter = flake8
Upper level GOterms are not significant and unuseful to plot.
Exclude top(first) and second hierarchy GOterm.
FastDTLmapper/lib/goea/goea.py
Lines 119 to 127 in e668b32
when I use the test example. I got the wrong info:
FileNotFoundError: [Errno 2] No such file or directory: 'output_minimum/02_dtl_reconciliation/OG0000000/treerecs/OG0000000_multifurcate.ufboot_recs.nwk'
How to solve this?
Input filename '.' character cause error in AnGST execution.
'.' character filename must check in input_check function.
close #45
I'd like to plot Brn/Dup/Trn info optionally.
Currently, user fasta annotation is formatted to completely unique serial id.
It is difficult to match unique serial id and original user fasta annotation id.
So, use origin fasta annotation id to unique serial id is preferable.
The GOEA results directory structure is deeply nested, making it difficult to get a full picture.
I'd like to consider outputting a summary report file that summarizes the whole picture and a directory that summarizes plot diagrams.
Current project cannot use pytest and packaging is impossible.
I'd like to resolve this problem, so change current project directory structure.
This overload is causing the PC to run fatally slow.
It looks like there is a competition for CPU resources between each process.
I'd like to solve this problem by switching to parallel processing using the "-T" option of IQ-TREE.
Remove "|" character from fasta annotation
Currently, the plot parameters in goea.py are controlled by passing them as arguments to each function.
However, as the function calls become deeper, passing arguments is becoming more complicated.
Therefore, we would like to change the way we define plot parameters to be instance variables of a class.
Dear moshi4,
I am Yuchun, an assistent professor in Sun Yat-sen University. I am trying to use FastDTLmapper for my data analysis, but an error was accured after I installed the software, even analyzed the provided example data. I have tried to install different version of FastDTLmapper, but it could not work. Could you please help me to solve this problem? Thanks
Best wishes,
Yuchun
Received email on 2022/09/03 16:16
Hi, Yuchun (@Yuchunyang2022)
Thank you for your interest in FastDTLmapper.
The reason for the error is that you do not have python2.7 installed and AnGST, the tool to run DTL reconciliation, requires python2.7.
To run FastDTLmapper successfully, you will need to build an environment where python 2.X and python 3.X coexist, such as Ubuntu 20.04, although it will take a lot of work.
To save users the trouble of building an environment, I created a Docker image for running FastDTLmapper (See here).
Although knowledge of Docker is required, it is recommended to run FastDTLmapper on this Docker image environment.
Currently, the GOEA results are plotted in gradient colors ranging from yellow to red, depending on the size of the P-value.
However, we would like to increase the number of options by adding an option to plot in user-specified colors in addition to the gradient color plots.
FastDTLmapper generates each phylogenetic tree nodes gene gain/loss information.
To clarify gain/loss genes functional trends, gain/loss GO enrichment analysis is useful.
Add below.
Refactor code with Pylance type hints check.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.