Giter Site home page Giter Site logo

biocyberman / bcbio-nextgen Goto Github PK

View Code? Open in Web Editor NEW

This project forked from bcbio/bcbio-nextgen

0.0 0.0 0.0 35.45 MB

Best-practice pipelines for fully automated high throughput sequencing analysis

Home Page: https://bcbio-nextgen.readthedocs.org

License: MIT License

Shell 0.64% Python 99.18% Ruby 0.03% Lua 0.15%

bcbio-nextgen's People

Contributors

a113n avatar alexcoman avatar apastore avatar biocyberman avatar brainstorm avatar brentp avatar cbrueffer avatar chapmanb avatar guillermo-carrasco avatar hammer avatar kern3020 avatar lbeltrame avatar lpantano avatar mariogiov avatar matted avatar matthdsm avatar mistrm82 avatar mjafin avatar pdiakumis avatar peterjc avatar pfaucon avatar porterjamesj avatar roryk avatar saketkc avatar stephenturner avatar tanglingfung avatar tetianakh avatar travc avatar vals avatar vladsavelyev avatar

Watchers

 avatar  avatar

bcbio-nextgen's Issues

how to dynamically locate ericscript resources based on genome build?

@chapmanb Brad and @tetianakh Tetiana

Need your help to pass test cases of ericscript. The part of dynamic locating of resource I don't understand very much yet.

I already download ericscript data and have them saved under:
/<bcbioinstalldir>/genomes/Hsapiens/hg38/rnaseq/ericscript/ericscript_db_homosapiens_ensembl84/data/homo_sapiens
and
/<bcbioinstalldir>/genomes/Hsapiens/hg19/rnaseq/ericscript/ericscript_db_homosapiens_ensembl73/data/homo_sapiens

However, the pytest failed, and I think because of incorrect data resources setup, and self._db_location is None.

I have this in the end of /<bcbio_installationdir>/galaxy/bcbio_system.yaml

 ericscipt:
    db: /<bcbioinstalldir>/genomes/Hsapiens/hg19/rnaseq/ericscript/ericscript_db_homosapiens_ensembl73
    env: /<bcbioinstalldir>/anaconda/envs/samtools0

The following is the test output. The code is from ericscript branch. In case you don't know already :): You need to clone, build and install this branch to test it.

pytest ./integration/rnaseq/test_ericscript.py

========================================================================================================================================================================== test session starts ===========================================================================================================================================================================
platform linux2 -- Python 2.7.14, pytest-3.2.3, py-1.4.34, pluggy-0.4.0
rootdir: /home/user/github/bcbio-nextgen, inifile:
plugins: mock-1.6.3, cov-2.5.1
collected 2 items

integration/rnaseq/test_ericscript.py FF

================================================================================================================================================================================ FAILURES ================================================================================================================================================================================
________________________________________________________________________________________________________________________________________________________ test_detect_fusions_with_ericscipt_without_disambiguate _________________________________________________________________________________________________________________________________________________________

install_test_files = None, data_dir = '/home/user/github/bcbio-nextgen/tests/data/automated', setup_logging = None

    @pytest.mark.install_required
    def test_detect_fusions_with_ericscipt_without_disambiguate(
            install_test_files, data_dir, setup_logging):
        """Run gene fusion analysis on trimmed pair-end reads with EricScript.
           Requires installation of EricScript and its reference data.
        """
        with make_workdir() as work_dir:
            sample_config = create_sample_config(
                data_dir, work_dir, disambiguate=False)
>           ericscript.run(sample_config)

integration/rnaseq/test_ericscript.py:37:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
../bcbio/rnaseq/ericscript.py:27: in run
    run_ericscript(config, input_files)
../bcbio/rnaseq/ericscript.py:53: in run_ericscript
    build_bwa_index_if_absent(es_config, sample_config)
../bcbio/rnaseq/ericscript.py:62: in build_bwa_index_if_absent
    if not os.path.exists(es_config.reference_index):
../bcbio/rnaseq/ericscript.py:146: in reference_index
    return os.path.join(self._db_location, self._REF_INDEX)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

a = None, p = ('data/homo_sapiens/allseq.fa.bwt',), path = None, b = 'data/homo_sapiens/allseq.fa.bwt'

    def join(a, *p):
        """Join two or more pathname components, inserting '/' as needed.
        If any component is an absolute path, all previous path components
        will be discarded.  An empty last part will result in a path that
        ends with a separator."""
        path = a
        for b in p:
            if b.startswith('/'):
                path = b
>           elif path == '' or path.endswith('/'):
E           AttributeError: 'NoneType' object has no attribute 'endswith'

/bcbio/anaconda/lib/python2.7/posixpath.py:70: AttributeError
__________________________________________________________________________________________________________________________________________________________ test_detect_fusions_with_ericscipt_with_disambiguate __________________________________________________________________________________________________________________________________________________________

install_test_files = None, data_dir = '/home/user/github/bcbio-nextgen/tests/data/automated', setup_logging = None

    @pytest.mark.install_required
    def test_detect_fusions_with_ericscipt_with_disambiguate(
            install_test_files, data_dir, setup_logging):
        """Run gene fusion analysis on disambiguated reads with EricScript.
           Requires installation of EricScript and its reference data.
        """
        with make_workdir() as work_dir:
            sample_config = create_sample_config(
                data_dir, work_dir, disambiguate=True)
>           ericscript.run(sample_config)

integration/rnaseq/test_ericscript.py:50:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
../bcbio/rnaseq/ericscript.py:27: in run
    run_ericscript(config, input_files)
../bcbio/rnaseq/ericscript.py:53: in run_ericscript
    build_bwa_index_if_absent(es_config, sample_config)
../bcbio/rnaseq/ericscript.py:62: in build_bwa_index_if_absent
    if not os.path.exists(es_config.reference_index):
../bcbio/rnaseq/ericscript.py:146: in reference_index
    return os.path.join(self._db_location, self._REF_INDEX)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

a = None, p = ('data/homo_sapiens/allseq.fa.bwt',), path = None, b = 'data/homo_sapiens/allseq.fa.bwt'

    def join(a, *p):
        """Join two or more pathname components, inserting '/' as needed.
        If any component is an absolute path, all previous path components
        will be discarded.  An empty last part will result in a path that
        ends with a separator."""
        path = a
        for b in p:
            if b.startswith('/'):
                path = b
>           elif path == '' or path.endswith('/'):
E           AttributeError: 'NoneType' object has no attribute 'endswith'

/bcbio/anaconda/lib/python2.7/posixpath.py:70: AttributeError
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Captured stderr call --------------------------------------------------------------------------------------------------------------------------------------------------------------------------
[2017-11-02T17:00Z] INFO: Converting disambiguated reads to fastq...
[2017-11-02T17:00Z] DEBUG: Picard: SamToFastq
[2017-11-02T17:00Z] DEBUG: unset JAVA_HOME && export PATH=/bcbio/anaconda/bin:$PATH && picard -Xms750m -Xmx2g -XX:+UseSerialGC SamToFastq INPUT=/home/user/github/bcbio-nextgen/tests/data/automated/../fusion/input/Test1.nsorted.human.sorted.bam FASTQ=/home/user/github/bcbio-nextgen/tests/test_automated_output/
bcbiotx/tmpxwz8sv/Test1.nsorted.human.sorted_1.fastq TMP_DIR=/home/user/github/bcbio-nextgen/tests/test_automated_output/bcbiotx/tmpFycsXy SECOND_END_FASTQ=/home/user/github/bcbio-nextgen/tests/test_automated_output/fastq_convert/Test1.nsorted.human.sorted_2.fastq VALIDATION_STRINGENCY=SILENT
[2017-11-02T17:00Z] DEBUG: 18:00:20.482 INFO  NativeLibraryLoader - Loading libgkl_compression.so from jar:file:/bcbio/anaconda/share/picard-2.14-0/picard.jar!/com/intel/gkl/native/libgkl_compression.so
[2017-11-02T17:00Z] DEBUG: [Thu Nov 02 18:00:20 CET 2017] SamToFastq INPUT=/home/user/github/bcbio-nextgen/tests/data/automated/../fusion/input/Test1.nsorted.human.sorted.bam FASTQ=/home/user/github/bcbio-nextgen/tests/test_automated_output/bcbiotx/tmpxwz8sv/Test1.nsorted.human.sorted_1.fastq SECOND_END_FASTQ=/home/user/github/bcbio-nextgen/tests/test_automated_output/fastq_convert/Test1.nsorted.human.sorted_2.fastq TMP_DIR=[/home/user/github/bcbio-nextgen/tests/test_automated_output/bcbiotx/tmpFycsXy] VALIDATION_STRINGENCY=SILENT    OUTPUT_PER_RG=false COMPRESS_OUTPUTS_PER_RG=false RG_TAG=PU RE_REVERSE=true INTERLEAVE=false INCLUDE_NON_PF_
READS=false CLIPPING_MIN_LENGTH=0 READ1_TRIM=0 READ2_TRIM=0 INCLUDE_NON_PRIMARY_ALIGNMENTS=false VERBOSITY=INFO QUIET=false COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false GA4GH_CLIENT_SECRETS=client_secrets.json USE_JDK_DEFLATER=false USE_JDK_INFLATER=false
[2017-11-02T17:00Z] DEBUG: [Thu Nov 02 18:00:20 CET 2017] Executing as vql@cybertron on Linux 4.4.0-97-generic amd64; OpenJDK 64-Bit Server VM 1.8.0_121-b15; Deflater: Intel; Inflater: Intel; Picard version: 2.14-SNAPSHOT
[2017-11-02T17:00Z] DEBUG: [Thu Nov 02 18:00:22 CET 2017] picard.sam.SamToFastq done. Elapsed time: 0.03 minutes.
[2017-11-02T17:00Z] DEBUG: Runtime.totalMemory()=760217600
======================================================================================================================================================================== 2 failed in 4.91 seconds ========================================================================================================================================================================

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.