Comments (4)
you can probably get around this by adding the full=True
kwarg to the groupby method.
you can also iterate over the result of the grouping outside of bedtools as:
for line in open(grouped_bed.fn):
stuff = line.split("\t")
...
from pybedtools.
As Brent points out, full=True
does the trick for this case.
Under the hood, there's a bunch of checking to see how many fields there are in a line, if it looks like GFF or not, which fields can be converted to an integer, and so on. Then Interval objects are created with a appropriate chrom/start/stop etc attributes and 0- or 1-based coordinates.
For the general case, I'd prefer not to support arbitrarily-formatted (though created by BEDTools) formats like this. But you could write a converter -- if full=True
wasn't an option, then you could modify the last two lines of your example to yield lines that are in fact BED-like:
def reformat(x):
for line in open(x.fn):
fields = line.strip().split('\t')
yield pybedtools.create_interval_from_list([
fields[1], fields[2], fields[3], fields[0], fields[4], fields[5]])
for line in reformat(grouped_bed):
print line.fields
from pybedtools.
Thanks so much Brent and Ryan, I didn't think to use .fn
(I guess it's technically only there temporarily right?) and totally missed full=True
. Very helpful, thanks!
from pybedtools.
Yep, if you don't do anything special, .fn
will be deleted when the interpreter exits. But you could use a .saveas()
call if you want to save it for later. Alternatively, when running groupby()
(or any method that wraps BEDTools commands), use the output
kwarg to have it save the file to a specific filename.
from pybedtools.
Related Issues (20)
- pybedtools.bedtool.BedTool.save_seqs leaves open .tmp files
- Support Python 3.10 and 3.11 HOT 1
- "python setup.py bdist_wheel did not run successfully" when pip installing with python v3.11 HOT 8
- to_dataframe() creates 0th row with generic names in nucleotide_content HOT 2
- build failure under python 3.11 HOT 6
- pybedtools intersect error HOT 2
- Cannot create a BedTool object from list of regions that uses np.int64 coordinates
- remove historical py27 support HOT 1
- bedtools intersect reported incorrect interval intersection HOT 3
- Cythonizing files requires `language_level=2` to be set in cythonize() HOT 4
- pybedtools multi_bam_coverage assistance HOT 2
- "fastaFromBed" error HOT 2
- intersect with multiple -b arguments not working with -sorted HOT 1
- Unable to install pybedtools==0.9.1 in Python3.10 HOT 4
- Len modifying the Bedtools after a filter HOT 2
- Has pybedtools considered packaging bedtools? HOT 3
- how to mask gap regions for randomization? HOT 1
- Issue while doing pip install pybedtools HOT 3
- Inconsistent behaviour when using files from `pathlib.PosixPath` with BedTool functions...
- pybedtools.bedtool.Bedtool.sort()
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pybedtools.