生物信息 « 小生这厢有礼了(BioFaceBook Personal Blog)

ubuntu 9.10 下安装glimmer3失败及其解决办法

(1)安装前，先修改src/Common下第26行#include <string> 为 #include <cstring>；再make，可是仍然会产生以下错误 * Make Target is all ##### Making Directory /usr/local/glimmer3.02/src/Common all ##### make[1]: 正在进入目录 `/usr/local/glimmer3.02/src/Common’ @@@@@@@@@@@@@@@@@@@ delcher.cc @@@@@@@@@@@@@@@@@@@@@ @@@@@@@@@@@@@@@@@@@ fasta.cc @@@@@@@@@@@@@@@@@@@@@ @@@@@@@@@@@@@@@@@@@ gene.cc @@@@@@@@@@@@@@@@@@@@@ gene.cc: In member function ‘void PWM_t::Print(FILE*)’: gene.cc:263: warning: deprecated conversion from string constant to ‘char*’ gene.cc: In function ‘int Char_Sub(char)’: gene.cc:448: error: invalid conversion from ‘const char*’ […]

PCR引物设计的黄金法则

1.引物最好在模板cDNA的保守区内设计。 DNA序列的保守区是通过物种间相似序列的比较确定的。在NCBI上搜索不同物种的同一基因，通过序列分析软件（比如DNAman）比对（Alignment），各基因相同的序列就是该基因的保守区。

2.引物长度一般在15~30碱基之间。引物长度（primer length）常用的是18-27 bp，但不应大于38，因为过长会导致其延伸温度大于74℃，不适于Taq DNA 聚合酶进行反应。

3.引物GC含量在40%~60%之间，Tm值最好接近72℃。 GC含量（composition）过高或过低都不利于引发反应。上下游引物的GC含量不能相差太大。另外，上下游引物的Tm值（melting temperature）是寡核苷酸的解链温度，即在一定盐浓度条件下，50%寡核苷酸双链解链的温度。有效启动温度，一般高于Tm值5~10℃。若按公式Tm= 4（G C） 2（A T）估计引物的Tm值，则有效引物的Tm为55~80℃，其Tm值最好接近72℃以使复性条件最佳。

4.引物3′端要避开密码子的第3位。如扩增编码区域，引物3′端不要终止于密码子的第3位，因密码子的第3位易发生简并，会影响扩增的特异性与效率。

5.引物3′端不能选择A，最好选择T。引物3′端错配时，不同碱基引发效率存在着很大的差异，当末位的碱基为A时，即使在错配的情况下，也能有引发链的合成，而当末位链为T时，错配的引发效率大大降低，G、C错配的引发效率介于A、T之间，所以3′端最好选择T。

6. 碱基要随机分布。引物序列在模板内应当没有相似性较高，尤其是3’端相似性较高的序列，否则容易导致错误引发（False priming）。降低引物与模板相似性的一种方法是，引物中四种碱基的分布最好是随机的，不要有聚嘌呤或聚嘧啶的存在。尤其3′端不应超过3个连续的G或C，因这样会使引物在GC富集序列区错误引发。

7. 引物自身及引物之间不应存在互补序列。引物自身不应存在互补序列，否则引物自身会折叠成发夹结构（Hairpin）使引物本身复性。这种二级结构会因空间位阻而影响引物与模板的复性结合。引物自身不能有连续4个碱基的互补。两引物之间也不应具有互补性，尤其应避免3′ 端的互补重叠以防止引物二聚体（Dimer与Cross dimer）的形成。引物之间不能有连续4个碱基的互补。引物二聚体及发夹结构如果不可避免的话，应尽量使其△G值不要过高（应小于4.5kcal/mol）。否则易导致产生引物二聚体带，并且降低引物有效浓度而使PCR 反应不能正常进行。

8. 引物5′ 端和中间△G值应该相对较高，而3′ 端△G值较低。 △G值是指DNA 双链形成所需的自由能，它反映了双链结构内部碱基对的相对稳定性，△G值越大，则双链越稳定。应当选用5′ 端和中间△G值相对较高，而3′ 端△G值较低（绝对值不超过9）的引物。引物3′ 端的△G 值过高，容易在错配位点形成双链结构并引发DNA 聚合反应。（不同位置的△G值可以用Oligo 6软件进行分析）

[…]

GYRA : CCMAR Computational Cluster (NGS)

Welcome to the CCMAR Computational Cluster Facility: GYRA – gyra.ualg.pt¶

The GYRA cluster facility is administered and maintained by Cymon J. Cox of the Plant Systematics and Bioinformatics Research Group (PSB).

System:¶

The GYRA cluster facility consists of:

Frontend: 16-core 2.3GHz 32GB DELL PowerEdge R715 compute-0-0: 8-core 2.6GHz 8GB DELL PowerEdge SC1435 compute-0-1: 8-core […]

生物資訊教學( Good sources)

生物資訊教學

生物資訊是一門整合生命科學與電腦科學的跨領域研究。此Wiki由【計算生物學】與【生物資訊演算法】之修課學生，國立中正大學生物資訊實驗室同學，與指導老師黃耀廷博士所共同編寫。生物資訊之背景知識，涵蓋部份生物、遺傳、統計、與演算法之背景知識，可分以下子課題介紹：

生命科學遺傳演化生物統計生物資訊演算法 Assembly Tool: 沒有參考基因體(Reference Genome)的物種(例如，鴿子、蓮霧)，定序完必須把破碎的序列重組出一條參考基因體。 Assembly Tools: de bruijn graph assemblers: Velvet、Euler-SR、SOAPdenovo、ABySS、CLCBio Overlap-graph assemblers: MIRA、AMOS package Scaffolder: SOAPdenovo, SSPACE, Bambus, IMAGE2. Gene Prediction Gene Ontology Annotation Alignment Tool: 已經有參考基因體的物種(例如，人類、稻米)，定序完可以直接把序列對回去(Align)參考基因體。 Short read alignment: BOWTIE、BWA、Bfast、SOAPaligner、MAQ Alignment檔案處理: SAMTools、Tabix for range query、Split Bam、挑出成對序列 Local realignment for small indels: SAMTools、GATK Long read alignment: BLAT、BLAST使用範例、wwwblast安裝設定、Lastz […]

Gene Prediction

Bacteria Gene Prediction

* GeneMark and Glimmer This provides the foundation for operon predictions and promotor predictions. One way to verify the gene prediction result is to check the presence of Shine-Dalgarno sequence in fron of each gene which is a purine-rich region with a consensus AGGAGG and is located within 20 bp upstream of […]

做蛋白分析（2，3级结构）软件参考网站 UCSF Chimera

http://www.cgl.ucsf.edu/chimera/

UCSF Chimera is a highly extensible program for interactive visualization and analysis of molecular structures and related data, including density maps, supramolecular assemblies, sequence alignments, docking results, trajectories, and conformational ensembles. High-quality images and animations can be generated. Chimera includes complete documentation and several tutorials, and can be downloaded free of charge for academic, […]

tree 统计学分析Graphpad Prism

1. 序列分类和种系分析：保守序列直接在GenBank上进行BLAST比较，所有序列使用欧洲生物信息学院的ClustalW2排列，然后用RDPII9.49进行分类。序列≧98%一致性的定为同一种，用MEGA4.0进行相邻合并分析绘制种系结构图。

2. 统计学分析：使用Graphpad Prism 软件进行配对卡方检验分析。

INSTALL MicrobeDB

shenzy@shenzy-ubuntu:/usr/local/lib/perl/5.10.1/MicrobeDB/scripts$ load_genome.pl -d /home/shenzy/Downloads/unpublished_genomes/Pseudomonas_aeruginosa_LESB58/ 2012/05/15 16:01:48> Working on /home/shenzy/Downloads/unpublished_genomes/Pseudomonas_aeruginosa_LESB58/ 2012/05/15 16:01:48> Parsing file: /home/shenzy/Downloads/unpublished_genomes/Pseudomonas_aeruginosa_LESB58/Pseudomonas_aeruginoas_LESB58.gbk 2012/05/15 16:01:48> Can’t find file:/home/shenzy/Downloads/unpublished_genomes/NCBI_completegenomes.txt . GenomeProject table will not contain much organism information. 2012/05/15 16:01:48> Can’t find file:/home/shenzy/Downloads/unpublished_genomes/NCBI_completegenomes.txt . GenomeProject table will be missing a few fields of information. DBD::mysql::st execute failed: Unknown column ‘distance_calculated’ in ‘field list’ at […]

Bpipe A tool for running and managing bioinformatics pipelines

Starting from a Shell Script

In this tutorial we will develop a Bpipe pipeline script for a realistic (but simplified) analysis pipeline used for variant calling on NGS data. In this pipeline the following stages are executed:

sequence alignment using bwa sorting and indexing output files using samtools PCR duplicate removal using Picard calling variants […]

MetaVelvet: a short read assember for metagenomics

MetaVelvet: a short read assember for metagenomics Introduction

Motivation: An important step of “metagenomics” analysis is the assembly of multiple genomes from mixed sequence reads of multiple species in a microbial community. Most conventional pipelines employ a single-genome assembler with carefully optimized parameters and post-process the resulting scaffolds to correct assembly errors. Limitations of the […]

小生这厢有礼了(BioFaceBook Personal Blog)

分类

Recent Comments

链接表