码迷,mamicode.com
首页 > 其他好文 > 详细

合并基因表达水平(merge gene expression levels, FPKM)

时间:2016-05-26 10:11:59      阅读:190      评论:0      收藏:0      [点我收藏+]

标签:

使用tophat和cufflinks计算RNA-seq数据的表达水平时,当一个基因在一个样本中有多个表达水平时需要合并它们的表达水平。

 

This code is a solution to collapsing duplicate FPKMs for a gene.

CollapseFPKM

This code is a solution to collapsing duplicate FPKMs for a gene

Problem/Issue:

In the cufflinks output files *_genes.expr (which reports the gene-level coordinates and expression values), sometimes I get more than one row for the same gene? It‘s like in some cases the FPKM values from the transcripts corresponding to the same gene do not get summed, although the transcripts are assigned to the same gene.

 

Reasons and Solution:

The multiple FPKM problem occurs when genes have transcripts that do not overlap with any other transcripts in the gene. For example, this occurs in the ENSG00000125388 gene from ENSEMBL/hg19. We are aware of this issue and will eventually change the behavior, but for now a simple solution is just to sum the FPKMs since the gene FPKMs are just the sum of the transcript FPKMs anyways.

 

网址:

https://sourceforge.net/projects/collapsefpkm/files/?source=navbar

合并基因表达水平(merge gene expression levels, FPKM)

标签:

原文地址:http://www.cnblogs.com/emanlee/p/5529850.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!