Comments (3)
Secondary Issue related to HPC Snakemake Upgrade #371
Dry-running the second half of the ChIP-seq pipeline causes a python ImportError
. This error is related to recent changes HPC has made to the python/3.5
module on Biowulf (see issue #371 for more information).
- TLDR: HPC upgraded the version of snakemake that is bundled with
module load python/3.5
. It was changed from snakemake version 5.1.3 to snakemake version 5.3.0. There are a few incompatibility issues with these two versions of snakemake due to changes in command-line arguements. To overcome these issues, snakemake/5.1.3 is loaded after module loading python 3.5.
The import error is due to the version of python that is bundled with snakemake (see: /usr/local/Anaconda/envs_app/snakemake/5.1.3/bin/
). This version of python only includes the standard libraries that come standard with python 3.6, and does not include pysam.
from pipeliner.
Update
After closer inspection, it looks that pysam is not actively being used. The function call to normalize_bam_file_chromosomes
, which was using pysam, has been commented out. For the time being, I will move the import statement inside of the function normalize_bam_file_chromosomes
.
def normalize_bam_file_chromosomes(bamfns, obamfns=[], suffix='.common_chrom.bam'):
from pysam import Samfile, FastaFile
counts = []
for bamfn1 in bamfns :
bam1 = Samfile(bamfn1)
cnt1 = Counter()
for aread1 in bam1 :
if aread1.is_unmapped :
continue
cnt1[aread1.reference_name] += 1
bam1.close()
counts.append(cnt1)
common = None
for cnt in counts :
if common != None :
common = common.intersection(cnt)
else :
common = set(cnt)
Changes will be made in the following locations:
- module load ccbrpipeliner/3.0
- master branch
- activeDev branch
Permanent solution:
Remove the snakemake run
for the rule PePr, and make a script. This will allow us to submit a job with the exact version of python that should be used.
from pipeliner.
Resolved
from pipeliner.
Related Issues (20)
- Trimming adapters HOT 1
- Fix error with PE duplicate removal. Incorporate Vishal's code.
- Calculate actual mean fragment length for PE data. Use for sicer PE.
- Improvement to differential expression html reports
- Move to STAR version >= 2.7.6a HOT 1
- Parse GTF on the fly HOT 1
- Variant Calling for RNAseq (hg38) stopped at star_align_2 HOT 15
- ChildIOException: File/directory is a child to another output HOT 3
- miRseq pipeline HOT 4
- Custom Reference Genome and GTF file HOT 3
- PIM not working for custom genome HOT 1
- Use msigdb for pathway analysis; perform additional analysis on specific db/pathway HOT 1
- Genome.json file error with pipeliner v4.0.2 HOT 3
- RNASeq DE naming error due to "." in sample HOT 1
- Fix mmul FASTQ screen; currently not using mmul as a species HOT 6
- groups.tab: Add validation test
- exome pipeline subjobs suspend HOT 2
- Cleanup of branches / versions / Biowulf HOT 2
- Move Pipeliner to Biowulf8 HOT 2
- test
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pipeliner.