[galaxy-user] Pilup file and counting reads in 200bp windows

Jennifer Jackson jen at bx.psu.edu
Tue Oct 4 19:39:55 EDT 2011


Hello Di,

Once the data is in pile-up format, the read identifier is lost, so 
summarizing per-read over a range is not possible.

However, you could graph the data, which would graph the number of reads 
represented per reference genome position. Do this with "Graph/Display 
Data -> Histogram", on the pileup coverage value and setting the number 
of breaks per chromosome so that they end up being each ~200 bases.

Or, going back to the BAM/SAM data, the tool "Regional Variation -> Make 
windows" can create 200 bp windows per chromosome (given an input BED3 
file - chrom, start, end). Then use this and the BAM/SAM alignments 
(converted to interval) as input for the tool "Regional Variation -> 
Feature coverage".

Hopefully one of the options will work for you,

Best,
Jen
Galaxy team

ps. Please send all questions directly to the mailing lists unless they 
contain private data/links.


On 10/4/11 3:18 PM, Di Nguyen wrote:
> Hi Jennifer,
>
> I have some Chip seq data that I can map with bwa, then I can generate a
> pileup file. My question is how do we count how many reads in a 200bp
> windows, say for Chromosome X or autosomes? Please help!
>
> My utmost appreciation,
>
> Di Nguyen
> UWashington
>
>
>

-- 
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org/Support


More information about the galaxy-user mailing list