CNVision

From RobsWiki

Jump to: navigation, search
Quicklinks Home Basics File formats Useful info


Contents

Welcome to CNVision

This program was designed and written by Stephan Sanders and Chris Mason from the State Lab at Yale University. It is designed to simplify all stages of predicting and analyzing CNVs, from running prediction algorithms and combining their results to visualizing the raw data and designing qPCR primers for confirmation.

If this is the first time you have used CNVision then check the CNVision basics for a quick overview.


Downloading CNVision

CNvision can be downloaded from SourceForge.net at this address:

https://sourceforge.net/projects/cnvision/files/


Installing CNVision

For directions on installing CNVision see the CNVision install page


CNV Data

BeadStudio Imports raw data and creates a new BeadStudio project
FinalReports Converts a BeadStudio project into data that CNVision can analyze


CNV Pipeline

See PipelineShows an image of the entire pipeline: CNVision Pipeline Map
StartInitiates a Logfile and Sequence tracker file, also asks how many processors to use
GenderDetermines the gender of FinalReports by looking at chrX homozygosity
QualityChecks FinalReports quality by counting the number of probes with extreme and wide logR values
ConvertConverts FinalReports into suitable input files for PennCNV and QuantiSNP
InputMakes a list of all files made during the Convert function and prepares a list of files for PennCNV and a batch file for QuantiSNP
GNOSISRuns the GNOSIS CNV detection tool on all files with 'FinalReport' in their name
GNOSIS oldRuns a version of GNOSIS CNV detection tool that modifies LogR values on chrX and chrY for both sexes, this is not necessary for newer Illumina chips (1M Duo, 370 Quad, Omni)
RF GNOSISReformats GNOSIS output to allow merging
PennCNVRuns the PennCNV CNV detection tool on all files in the list created by Input
RF PennCNVReformats PennCNV output to allow merging
QuantiSNPRuns the QuantiSNP CNV detection tool on all files in the batch file created by Input
RF QuantiSNPReformats QuantiSNP output to allow merging
MergeMerges files in the Merge input format
RareCompares merged output to a list of common loci in the Annotation Folder
BadCombines lists of samples that failed quality control into a single list
PedfileCreates a modified pedfile without samples in the list generated by Bad
DenovoCompares children to parents using a pedfile (with 'pedfile' in the filename)
SummaryConverts the Denovo output into a readable format and makes a UCSC genome browser file
AnnotateAnnotates the Summary output for genes, common variants and regions of interest
HomodelFinds homozygous deletions in FinalReports
RF HomodelReformats Homodel output to the Merge input format
AnnotateAnnotates the Homodel output for genes, common variants and regions of interest
CleanRemoves any unnecessary files from the folder

Quality Control

QualityChecks FinalReports quality by counting the number of probes with extreme and wide logR values
FamilyChecks the family relationships of all FinalReports in a pedfile using PLINK
OrderChecks that all FinalReports are in the same order
StrandChecks that all FinalReports are from the same strand
OverlapLooks for overlap between regions or CNVs in a file


CNV Analysis

De Novo 2A different method of de novo CNV prediction that uses the FinalReports and a pedfile to work out whether a list CNVs are de novo or inherited
LargeAn algorithm for joining together large CNVs that have been made into smaller predictions with gaps
RecurrentWorks out the relative frequency of CNVs from two files of CNVs
Recurrent thresSame as Recurrent, but filters the input by CNV type, number of algorithms and number of probes
FrequencyDetermines the frequency of CNVs/regions across all samples
eTaqmanExamines all FinalReports for evidence of CNVs at specified co-ordinates
LociReduces a list of regions/CNVs to non-overlapping loci
CompareCompares the output of two algorithms/chips run on the same samples
ProportionCalculates the number of CNVs and individuals meeting specified criteria to calculate CNV burden
TemplateCreates a template for running Multiproportion
MultiproportionSimilar to Proportion, but can do numerous analyses. Use the Template function above to create a template specifiying the analyses to perform, add the analyses then run this function.


Annotation

AnnotateAnnotates a list of regions (chr, start, stop) against a series of annotation files (eg genes); this function allows numerous analyses and can be customised using the Template function below
TemplateMakes a custom annotation template of all the files in the Annotation Folder
PedfileAnnotates a list of regions (chr, start, stop) against a pedfile to show family, gender and phenotype
SamplesAnnotates a list of regions against a list of samples
GenesFinds the chr, position, number of exons and gene name of each RefSeq Gene Symbol (eg NRXN1)
Genes2Similar to Genes, but performs the analysis of the internet and is slower
SNPcountWorks out the number of probes in a FinalReports file within each region/CNV
dbSNPWorks out the number of common SNPs present in dbSNP within each region/CNV
SNPUses the internet to look up SNP 'rs' IDs in dbSNP and return chr and position

Visualization

CNVsMakes a PDF of the LogR and BAF values for CNVs per family (SOR plot)
BEDgraphMakes a UCSC genome browser custom track of Frequency files
WiggleMakes a UCSC genome browser custom track of large Frequency files
BEDfileMakes a UCSC genome browser custom track of CNV files
FinalReportMakes a UCSC genome browser custom track of a FinalReport


FinalReport Analysis

SampleIDQuickly finds the sample ID and chip type from FinalReports
GenderLooks for 'FinalReport' and determines sample ID, chip type and sex
AncestryUses 400 ancestry informative markers (AIMs) to determine connitent of origin
Mean LogRCalculates the Mean, median and stdev in LogR of each chromosome
ConsensusMakes a list of the probes common to different chip types in FinalReports
PruneRemoves probes that are not present in the consensus list from Consensus out of FinalReports
PLINKConverts FinalReports into PLINK input format (.ped and .map)
HomodelFinds homozygous deletions in FinalReports
UPDCalculates the percentage homozygosity per chromosome to detect uniparental disomy


File Manipulation

ListMakes a list of every file and folder
CopyCopies files in a text file to a new folder
MoveMoves files in a text file to a new folder
vLookupLooks up terms in a reference file to add a new column
SplitLooks for differences in a specified column and splits the file accordingly
CombineCombines multiple files that have columns in the same order
SortSorts the file by up to three columns
ColumnsChanges the order of columns in a file
Trim topRemoves lines from the top of a file
Trim bottomRemoves lines from the bottom of a file (useful for making a testfile)
DuplicatesRemoves duplicate lines in a file
DifferencesFind differences between two files that should be identical


Other Tools

GC contentCalculates the GC content for a list of regions
Get DNALooks up the DNA sequence for a list of regions
PrimersDesigns qPCR or PCR primers for a list of regions

Advanced Users

The graphical user interface (CNVision.jar) is not necessary for running the functions. The perl script Combined_CNVv1.73.pl can be used alone to perform all these functions using command prompt (Windows) or terminal (Mac and Linux). On the pages showing how a specific function is performed in CNVision, the command line description is also given.

Personal tools