[go: nahoru, domu]

Skip to content
This repository has been archived by the owner on Mar 28, 2024. It is now read-only.
/ copycat Public archive

Copycat is a simple script to capture and bin the read coverage across a genome from a bam file of read alignments. It relies on bedtools to get coverage for each individual nucleotide in the genome using bedtools genomecov, then bins and organizes these coverage values into 10kb bins and outputs the coverage information in a .csv format (for up…

Notifications You must be signed in to change notification settings

MariaNattestad/copycat

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

Copycat

Copycat is a simple script to capture and bin the read coverage across a genome from a bam file of read alignments.

It relies on bedtools to get coverage for each individual nucleotide in the genome using bedtools genomecov, then bins and organizes these coverage values into 10kb bins and outputs the coverage information in a .csv format (for upload to SplitThreader) and in a .seg format (for viewing the copy number profile in IGV)

Usage: copycat alignments.sorted.bam genome_file output_prefix

  • alignments.sorted.bam:

    • Bam file must be sorted.
    • Remember to filter for mapping quality first if desired: \n\tsamtools view -q -b BAM.bam > FILTERED_BAM.bam
  • genome_file:

    • The genome file should be structured as chromosome_name<tab>chromosome_size
    • For example:
chr1    249250621
chr2    243199373
  • output_prefix:
    • Prefix for all output files. Output files will be output_prefix.coverage.10kb, output_prefix.coverage.10kb.csv, output_prefix.coverage.10kb.for_IGV.seg

About

Copycat is a simple script to capture and bin the read coverage across a genome from a bam file of read alignments. It relies on bedtools to get coverage for each individual nucleotide in the genome using bedtools genomecov, then bins and organizes these coverage values into 10kb bins and outputs the coverage information in a .csv format (for up…

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages