<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>小生这厢有礼了(BioFaceBook Personal Blog) &#187; My project</title>
	<atom:link href="https://www.biofacebook.com/?cat=42&#038;feed=rss2" rel="self" type="application/rss+xml" />
	<link>https://www.biofacebook.com</link>
	<description>记录生物信息学点滴足迹（NGS,Genome,Meta,Linux)</description>
	<lastBuildDate>Sun, 23 Aug 2020 03:28:53 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.41</generator>
	<item>
		<title>Amber的分子动力学模拟 (转贴）</title>
		<link>https://www.biofacebook.com/?p=944</link>
		<comments>https://www.biofacebook.com/?p=944#comments</comments>
		<pubDate>Thu, 16 Oct 2014 09:23:45 +0000</pubDate>
		<dc:creator><![CDATA[szypanther]]></dc:creator>
				<category><![CDATA[My project]]></category>
		<category><![CDATA[生物信息]]></category>

		<guid isPermaLink="false">http://www.biofacebook.com/?p=944</guid>
		<description><![CDATA[<p style="color: #444444;">amber进行MD的步骤如下：PDB文件需要去掉结晶水、氢 和金属离子（非必须）。 网上教程：http://enzyme.fbb.msu.ru/Tutorials/</p> <p style="color: #444444;">金属酶的模拟：http://ambermd.org/tutorials/advanced/tutorial1_orig/ 蛋白和配体的模拟：</p> <p style="color: #444444;">1：分子系统的准备</p> <p style="color: #444444;"> 命令如下： tleap //进入leap source leaprc.ff99SB //加载蛋白力场 source leaprc.gaff //如果有小分子加载小分子力场 loadamberparams lig.frcmod //特殊情况还要加载其他特殊的力场参数 mol = loadpdb protein.pdb //定义mol变量并加载蛋白分子 bond mol.1.SG mol.6.SG //如果要二硫键。 check mol //检查蛋白 saveamberparm mol protein_vac.top protein_vac.crd //生成真空拓扑、坐标文件 solvatebox mol TIP3PBOX 10.0 //此为方形，八面体用solvateoct charge mol //检测系统电荷 </p> <p style="color: [...]]]></description>
				<content:encoded><![CDATA[<p style="color: #444444;"><strong>amber进行MD的步骤如下：PDB文件需要去掉结晶水、氢 和金属离子（非必须）。<br />
网上教程：http://enzyme.fbb.msu.ru/Tutorials/</strong></p>
<p style="color: #444444;"><strong>金属酶的模拟：http://ambermd.org/tutorials/advanced/tutorial1_orig/<br />
蛋白和配体的模拟：</strong></p>
<p style="color: #444444;"><strong>1：分子系统的准备</strong></p>
<p style="color: #444444;"><strong>   命令如下：<br />
tleap                         //进入leap<br />
source  leaprc.ff99SB         //加载蛋白力场<br />
source  leaprc.gaff           //如果有小分子加载小分子力场<br />
loadamberparams lig.frcmod //特殊情况还要加载其他特殊的力场参数<br />
mol = loadpdb  protein.pdb    //定义mol变量并加载蛋白分子<br />
bond mol.1.SG mol.6.SG        //如果要二硫键。<br />
check mol                     //检查蛋白<br />
saveamberparm mol protein_vac.top  protein_vac.crd  //生成真空拓扑、坐标文件<br />
solvatebox mol  TIP3PBOX 10.0     //此为方形，八面体用solvateoct<br />
charge mol                     //检测系统电荷<br />
</strong></p>
<p style="color: #444444;"><span style="color: #444444;">    addions mol Na+  1             //加入+电荷</span><br style="color: #444444;" /><span style="color: #444444;">    addions mol CL-  1             //加入-电荷</span><br style="color: #444444;" /><span style="color: #444444;">    saveamberparm mol protein_sol.top  protein_sol.crd  //生成拓扑、坐标文件</span><br style="color: #444444;" /><span style="color: #444444;">    quit                           //退出 leap</span><br style="color: #444444;" /><span style="color: #444444;">    ambpdb -p protein_sol.top &lt;protein_sol.crd&gt;   protein_sol.pdb //生成体系pdb便于查看。</span></p>
<p style="color: #444444;"><strong>说明：<br />
</strong><strong>        &lt;1&gt;</strong>PDB格式文件里面规定了二硫键，在amber生成参数文件时候，要用bond命令将二硫键固定。<br />
SSBOND   1 CME A  381    CME A  381<br />
SSBOND   2 CME A  417    CME A  417<br />
<strong>&lt;2&gt; 特殊配体参数文件的制备<br />
antecharmber -i lig.pdb -fi pdb -o lig.mol2 -fo mol2 -c bcc（或者gas) -s 2<br />
parmchk -i lig.mol2 -f mol2 -o lig.frcmod<br />
source leaprc.gaff<br />
LIG = loadmol2 lig.mol2<br />
</strong></p>
<p style="color: #444444;"><span style="color: #444444;">          check LIG</span><br style="color: #444444;" /><span style="color: #444444;">           loadamberparm lig.frcmod</span><br style="color: #444444;" /><span style="color: #444444;">           check LIG</span><br style="color: #444444;" /><span style="color: #444444;">           saveamberparm LIG lig.top lig.crd</span><br style="color: #444444;" /><span style="color: #444444;">          &lt;3&gt; 特殊配体参数文件的制备</span><br style="color: #444444;" /><span style="color: #444444;">          amber 加力场时如出现   Created a new atom named : .R&lt;ILE 166&gt; .A&lt;CD1 14&gt;</span><br style="color: #444444;" /><span style="color: #444444;">          表示amber可能不识别这些原子。可以在pdb中删除让amber自动添加。</span><strong style="color: #444444;"><br />
</strong><span style="color: #444444;">          </span><strong style="color: #444444;">&lt;4&gt; Amber 力场<br />
</strong><span style="color: #444444;"> </span><strong style="color: #444444;">ff10力场(parm10.dat)：</strong><span style="color: #444444;">对ff99的各种参数补丁的集合，相当于parm99.dat+frcmod.ff03+bsc0+chi.OL3+新的离子参数+原子和残基名的修改以顺应PDB format version 3。这是目前最好的amber力场。</span><br style="color: #444444;" /><strong style="color: #444444;">ff03.r1力场(parm99.dat+frcmod.ff03)：</strong><span style="color: #444444;">ff99力场的修改版。获取电荷时通过连续介电模型表现溶剂可极化效应，修改了蛋白phi、psi骨架参数，减少了对螺旋构象的偏爱。核酸参数相对于ff99没变。ff03.r1与amber9中的ff03略有不同，那时仍用的是ff94的方法得来的碳、氮端基原子电荷，如果仍想用那时代的ff03就调用oldff/leaprc.ff03.</span><br style="color: #444444;" /><strong style="color: #444444;">ff03ua力场(parm99.dat+frcmod.ff03+frcmod.ff03ua)：</strong><span style="color: #444444;">ff03力场的united-atom版本，侧链的氢原子被united了，骨架上的氢原子和芳香环上的氢原子仍被保留。由于骨架还是全原子故骨架势参数没变，侧链上的参数因用了united故重新拟合。核酸参数完全没变，且还是全原子。</span><br style="color: #444444;" /><strong style="color: #444444;">ff02力场(parm99.dat+frcmod.ff02pol.r1)：</strong><span style="color: #444444;">ff99力场的可极化版，给原子上增加了可极化的偶极子。frcmod.ff02pol.r1是对原ff02的扭转参数的修正。</span><br style="color: #444444;" /><strong style="color: #444444;">ff02EP力场(parm99EP.dat+frcmod.ff02pol.r1)：</strong><span style="color: #444444;">ff02力场基础上给诸如氧、氮、硫原子增加了偏离原子中心的点电荷以表现孤对电子效应。据称比ff02稍好点。</span><br style="color: #444444;" /><strong style="color: #444444;">ff99力场(parm99.dat)：</strong><span style="color: #444444;">大部分参数来自ff94力场，修改了许多扭转角的参数。甘氨酸的骨架参数有问题，螺旋和延展构象的平衡性不对。而对于DNA，ff99长时间模拟中亚稳态占统治地位，即alpha和gamma二面角倾向于分别为gauche+和trans状态。虽然在RNA中也有这问题，但不严重。ff99的这些毛病在ff94里也有。</span><br style="color: #444444;" /><strong style="color: #444444;">ff99SB力场(parm99.dat+frcmod.ff99SB)：</strong><span style="color: #444444;">对ff99的蛋白二面角参数进行修正，二级结构间分布的比例得到了改善，也解决了甘氨酸骨架参数问题。</span><br style="color: #444444;" /><strong style="color: #444444;">bsc0(frcmod.parmbsc0)：</strong><span style="color: #444444;">解决上述ff99在核酸模拟问题上的补丁，同时还改进了RNA的糖苷的gamma二面角扭转势。可参考http://mmb.pcb.ub.es/PARMBSC0。</span><br style="color: #444444;" /><strong style="color: #444444;">ff99SB+bsc0力场：</strong><span style="color: #444444;">把bsc0补丁用到ff99SB上，相对于ff99同时增进对蛋白和核酸的效果。这个组合使gamma二面角过分偏离了trans型。如果初始结构有很多gamma角为trans的情况，还是用ff99比较好。</span><br style="color: #444444;" /><strong style="color: #444444;">ff99SBildn(frcmod.ff99SBildn)：</strong><span style="color: #444444;">在ff99SB基础上修改氨基酸侧链参数的补丁。</span><br style="color: #444444;" /><strong style="color: #444444;">ff99SBnmr(frcmod.ff99SBnmr)：</strong><span style="color: #444444;">在ff99SB基础上修改骨架扭转项参数以更符合NMR数据的补丁。</span><br style="color: #444444;" /><strong style="color: #444444;">ff98力场(parm98.dat)：</strong><span style="color: #444444;">对ff94改进了糖苷的扭转角参数。</span><br style="color: #444444;" /><strong style="color: #444444;">ff96力场(parm96.dat)：</strong><span style="color: #444444;">与ff94扭转角不同，算出来的能量更接近量化结果。来自Beachy et al，由于构象有明显偏向beta等问题，使用不广泛。</span><br style="color: #444444;" /><strong style="color: #444444;">ff94力场(parm94.dat)：</strong><span style="color: #444444;">来自Cornell, Kollman et al。适合溶剂环境。电荷由RESP HF/6-31G*获得。</span><br style="color: #444444;" /><strong style="color: #444444;">ff86力场(parm91X.dat)：</strong><span style="color: #444444;">将ff84扩展为全原子力场。和ff84一样对氢键也是用Lennard-Jones 10-12势，故如果想在sander里用ff84/86，得重新带着-DHAS_10_12选项编译。之所以相应的文件叫parm91X是因为对原始ff86做了一些修正。（parm91X.dat是parm91.dat的补完版，加入了一些非键项，但非键项比如Mg、I等的参数都没调好，只是近似。）</span><br style="color: #444444;" /><strong style="color: #444444;">ff84(parm91X.ua.dat)：</strong><span style="color: #444444;">最早的AMBER力场，用于模拟核酸和蛋白质的联合原子力场。不推荐使用，但在真空或者距离依赖的介电常数下模拟还有用。</span><br style="color: #444444;" /><strong style="color: #444444;">parmAM1和parmPM3力场(parmAM1.dat/parmPM3.dat)：</strong><span style="color: #444444;">用这个参数对蛋白质优化可以得出与AM1/PM3相同的优化结果。如今已没什么价值。</span><br style="color: #444444;" /><strong style="color: #444444;">GAFF力场(gaff.dat)=Generation Amber Force Field：</strong><span style="color: #444444;">普适型有机小分子力场，函数形式和AMBER力场相同，与AMBER力场完全兼容。</span><br style="color: #444444;" /><strong style="color: #444444;">GLYCAM-06力场(GLYCAM_06g.dat)：</strong><span style="color: #444444;">对以前GLYCAM力场做了改进，并且纳入了一小部分脂类的参数。</span><br style="color: #444444;" /><strong style="color: #444444;">GLYCAM-04EP力场(GLYCAM04EP.dat)：</strong><span style="color: #444444;">将GLYCAM04扩展到可用于TIP5P模型下的模拟。给氧加上非原子中心点电荷表现孤对电子效应。</span><br style="color: #444444;" /><strong style="color: #444444;">GLYCAM-04力场(GLYCAM04.dat)=glycans and glycoconjugates in AMBER：</strong><span style="color: #444444;">专用于糖的模拟，和AMBER完全兼容，可一起用于糖蛋白的模拟。官网：http://glycam.ccrc.uga.edu/ccrc/index.jsp</span><br style="color: #444444;" /><span style="color: #444444;">另外，Amber程序还支持AMOEBA力场，也可以通过自带的CHAMBER工具来支持CHARMM力场，这里就不提了。</span></p>
<p style="color: #444444;"><strong>2：溶剂优化<br />
</strong><strong>模拟的参数文件如下：min1.in<br />
</strong><strong>protein: initial minimisation solvent + ions<br />
&amp;cntrl<br />
imin = 1,                 //任务是优化， 0为分子动力学<br />
maxcyc = 1000,      //优化步数<br />
ncyc = 500,             // 前500为最陡下降 后500为共轭梯度<br />
ntb = 1,                  // 周期边界条件 0 不采用  1 定容  2 定压<br />
</strong></p>
<p style="color: #444444;"><span style="color: #444444;">ntr = 1,                   //优化时需要一些约束原子  -ref</span><br style="color: #444444;" /><span style="color: #444444;">cut = 10                  //非键相互作用阈值为 10唉</span><br style="color: #444444;" /><span style="color: #444444;">/</span><br style="color: #444444;" /><strong style="color: #444444;">Hold the protein fixed   //约束说明<br />
500.0                            //作用在肽键上的力<br />
RES 1 194                    //限制的残基序号<br />
END<br />
END<br />
</strong><span style="color: #444444;">    红色部分表示限制蛋白部分。其中500.0单位是kcal/mol，表示作用在肽链上使其不动的力</span><strong style="color: #444444;">。“RES 1 194”表示肽链残基数目，</strong><span style="color: #444444;">因为我们学习使用的protein有194个残基。</span><br style="color: #444444;" /><span style="color: #444444;">    模拟命令如下：</span><br style="color: #444444;" /><strong style="color: #444444;">sander –O –i min1.in –o min1.out –p protein_sol.top –c protein_sol.crd –r protein_min1.rst –ref protein_sol.crd &amp;<br />
</strong><strong style="color: #444444;">3:蛋白的优化 min2.in<br />
protein: initial minimisation whole system<br />
&amp;cntrl<br />
imin = 1,<br />
maxcyc = 2500,<br />
ncyc = 1000,<br />
ntb = 1,<br />
ntr = 0,<br />
cut = 10<br />
/<br />
命令如下：<br />
</strong><strong style="color: #444444;">sander –O –i min2.in –o min2.out –p protein_sol.top –c protein_min1.rst –r protein_min2.rst &amp;<br />
</strong><span style="color: #444444;">4: 有限分子动力学模拟</span><br style="color: #444444;" /><span style="color: #444444;">       在这个步骤中，我们将主要目的是对特定的原子使用作用力使其能量优化。我们要优化溶剂环境，至少需要10ps，我们将使用20ps用来优化我们上两步制作的分子系统的周期性边界的溶剂环境。</span><br style="color: #444444;" /><span style="color: #444444;"> </span><strong style="color: #444444;">命令配置文件md1.in如下：<br />
</strong><span style="color: #444444;">protein: 20ps MD with res on protein</span><br style="color: #444444;" /><span style="color: #444444;">&amp;cntrl</span><br style="color: #444444;" /><strong style="color: #444444;">imin = 0,  irest = 0, </strong><span style="color: #444444;">  </span><strong style="color: #444444;">ntx = 1,</strong><span style="color: #444444;">           //表示模拟过程为分子动力学，不是能量最优化。</span><br style="color: #444444;" /><strong style="color: #444444;">nstlim = 10000, dt = 0.002,           </strong><span style="color: #444444;"> //表示计算的步数。</span><br style="color: #444444;" /><strong style="color: #444444;">ntc = 2,</strong><span style="color: #444444;">             //1表示不实用使用，2表示氢键将被计算，3表示所有键都将被计算在内。</span><br style="color: #444444;" /><strong style="color: #444444;">ntf = 2,<br />
</strong><strong style="color: #444444;">cut = 10,</strong><br style="color: #444444;" /><strong style="color: #444444;">ntb = 1,  </strong><span style="color: #444444;">                // 表示分子动力学过程保持体积固定。</span><br style="color: #444444;" /><strong style="color: #444444;">ntr = 1,</strong><br style="color: #444444;" /><strong style="color: #444444;">tautp = 0.1,           </strong><span style="color: #444444;">//热浴时间常数，缺省为1.0。小的时间常数可以得到较好的耦联。</span><br style="color: #444444;" /><strong style="color: #444444;">ntpr = 100,<br />
</strong><strong style="color: #444444;">ntwx = 500,<br />
ntwr = 1000<br />
ntt = 3, </strong><span style="color: #444444;">                     //温度转变控制，3表示使用兰格氏动力学</span><br style="color: #444444;" /><strong style="color: #444444;">gamma_ln = 1.0, </strong><span style="color: #444444;">   //表示当ntt＝3时的碰撞频率，单位为ps-1（请参考AMBER手册）</span><strong style="color: #444444;"> </strong><span style="color: #444444;"> </span><br style="color: #444444;" /><strong style="color: #444444;">tempi = 0.0,   </strong><span style="color: #444444;">        //系统开始时的温度。</span><br style="color: #444444;" /><span style="color: #444444;"> </span><strong style="color: #444444;">temp0 = 300.0, </strong><span style="color: #444444;">    //表示最后系统到达并保持的温度，单位为K。 </span><br style="color: #444444;" /><span style="color: #444444;"> /</span><br style="color: #444444;" /><span style="color: #444444;">keep protein fixed with weak restraints</span><br style="color: #444444;" /><span style="color: #444444;">10.0</span><br style="color: #444444;" /><span style="color: #444444;">RES 1 194   //蛋白的残基数</span><br style="color: #444444;" /><span style="color: #444444;">END</span><br style="color: #444444;" /><span style="color: #444444;">END</span><br style="color: #444444;" /><span style="color: #444444;"> </span><strong style="color: #444444;">   我们将使用一个较小的作用力，10kcal/mol。在分子动力学中，当ntr＝1时，作用力只需要5-10kcal/mol（我们需要引用一个坐标文件做分子动力学过程的比较，我们需要使用&#8221;-ref&#8221;参数）。太大的作用力同时使用Shake算法和2fs步长将使整个系统变得不稳定，因为大的作用力使系统中的原子产生大频率的振动，模拟过程并步需要。</p>
<p></strong><span style="color: #444444;">运行命令如下：</span><br style="color: #444444;" /><br style="color: #444444;" /><strong style="color: #444444;">sander –O –i md1.in –o md1.out –p protein_sol.top –c protein_min2.rst –r protein_md1.rst –x protein_md1.mdcrd –ref protein_min2.rst –inf md1.info&amp;<br />
5、生成相分子动力学模拟<br />
</strong><span style="color: #444444;">    这一步进行整个系统的分子动力学模拟，而不对某些特定原子位置进行限制。配置文件</span><strong style="color: #444444;">md2.in</strong><span style="color: #444444;">如下：</span><br style="color: #444444;" /><span style="color: #444444;">protein: 250ps MD</span><br style="color: #444444;" /><span style="color: #444444;">&amp;cntrl</span><br style="color: #444444;" /><span style="color: #444444;">imin = 0, irest = 1, ntx = 7,</span><br style="color: #444444;" /><span style="color: #444444;">ntb = 2, pres0 = 1.0, ntp = 1,</span><br style="color: #444444;" /><span style="color: #444444;">taup = 2.0,</span><br style="color: #444444;" /><span style="color: #444444;">cut = 10, ntr = 0,</span><br style="color: #444444;" /><span style="color: #444444;">ntc = 2, ntf = 2,</span><br style="color: #444444;" /><span style="color: #444444;">tempi = 300.0, temp0 = 300.0,</span><br style="color: #444444;" /><span style="color: #444444;">ntt = 3, gamma_ln = 1.0,</span><br style="color: #444444;" /><span style="color: #444444;">nstlim = 125000, dt = 0.002,</span><br style="color: #444444;" /><span style="color: #444444;">ntpr = 100, ntwx = 500, ntwr = 1000</span><br style="color: #444444;" /><span style="color: #444444;">/</span><br style="color: #444444;" /><span style="color: #444444;">使用以下命令进行MD：</span><br style="color: #444444;" /><strong style="color: #444444;">sander –O –i md2.in –o md2.out –p protein.top –c protein_md1.rst – r protein_md2.rst –x protein_md2.mdcrd –ref protein_md1.rst –inf md2.info &amp;<br />
6:Amber模拟数据的分析<br />
</strong><span style="color: #444444;">    模拟完成后，接下来要对得到的数据进行分析。主要数据文件</span><strong style="color: #444444;">.out</strong><span style="color: #444444;">文件，包含系统能量、温度，压力等等；.mdcrd文件，是分子动力学轨迹文件，可以求系统蛋白的RMSD，回转半径等等。</span><strong style="color: #444444;">    </strong></p>
]]></content:encoded>
			<wfw:commentRss>https://www.biofacebook.com/?feed=rss2&#038;p=944</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>R pheatmap</title>
		<link>https://www.biofacebook.com/?p=867</link>
		<comments>https://www.biofacebook.com/?p=867#comments</comments>
		<pubDate>Wed, 12 Feb 2014 04:04:41 +0000</pubDate>
		<dc:creator><![CDATA[szypanther]]></dc:creator>
				<category><![CDATA[My project]]></category>
		<category><![CDATA[R语言]]></category>

		<guid isPermaLink="false">http://www.biofacebook.com/?p=867</guid>
		<description><![CDATA[<p>&#62; library(caTools); &#62; library(bitops); &#62; library(grid); &#62; data=read.csv(&#8220;/home/shenzy/Desktop/R/Bac.heatmap1.csv&#8221;) &#62; data=read.csv(&#8220;/home/shenzy/Desktop/R/Bac.heatmap1.2.csv&#8221;) &#62; View(data) &#62; data=read.csv(&#8220;/home/shenzy/Desktop/R/Bac.heatmap1.2.csv&#8221;,sep=&#8221;\t&#8221;) &#62; View(data) &#62; row.names(data) &#60;- data$X.OTU.ID; &#62; View(data) &#62; data_matrix&#60;-data[,2:15] &#62; View(data_matrix) &#62; data_matrix&#60;-data[,2:14] &#62; View(data_matrix) &#62; View(data) &#62; data_matrix&#60;-data[,1:14] &#62; View(data_matrix) &#62; library(pheatmap) &#62; data_matrix[is.na(data_matrix)]&#60;-1 &#62; View(data_matrix) &#62; data_log10&#60;-log10(data_matrix) &#62; View(data_log10) &#62; data_log2&#60;-log2(data_matrix) &#62; View(data_log2) &#62; pheatmap(data_log2,fontsize=9, fontsize_row=6) &#62; pheatmap(data_log2, [...]]]></description>
				<content:encoded><![CDATA[<p>&gt; library(caTools);<br />
&gt; library(bitops);<br />
&gt; library(grid);<br />
&gt; data=read.csv(&#8220;/home/shenzy/Desktop/R/Bac.heatmap1.csv&#8221;)<br />
&gt; data=read.csv(&#8220;/home/shenzy/Desktop/R/Bac.heatmap1.2.csv&#8221;)<br />
&gt; View(data)<br />
&gt; data=read.csv(&#8220;/home/shenzy/Desktop/R/Bac.heatmap1.2.csv&#8221;,sep=&#8221;\t&#8221;)<br />
&gt; View(data)<br />
&gt; row.names(data) &lt;- data$X.OTU.ID;<br />
&gt; View(data)<br />
&gt; data_matrix&lt;-data[,2:15]<br />
&gt; View(data_matrix)<br />
&gt; data_matrix&lt;-data[,2:14]<br />
&gt; View(data_matrix)<br />
&gt; View(data)<br />
&gt; data_matrix&lt;-data[,1:14]<br />
&gt; View(data_matrix)<br />
&gt; library(pheatmap)<br />
&gt; data_matrix[is.na(data_matrix)]&lt;-1<br />
&gt; View(data_matrix)<br />
&gt; data_log10&lt;-log10(data_matrix)<br />
&gt; View(data_log10)<br />
&gt; data_log2&lt;-log2(data_matrix)<br />
&gt; View(data_log2)<br />
&gt; pheatmap(data_log2,fontsize=9, fontsize_row=6)<br />
&gt; pheatmap(data_log2, color = colorRampPalette(c(&#8220;white&#8221;, &#8220;yellow&#8221;, &#8220;firebrick3&#8243;))(50), fontsize=9, fontsize_row=6)<br />
&gt; pheatmap(data_log2, color = colorRampPalette(c(&#8220;white&#8221;, &#8220;blue&#8221;, &#8220;firebrick3&#8243;))(50), fontsize=9, fontsize_row=6)<br />
&gt; help(pheatmap)<br />
&gt; pheatmap(data_log2, color = colorRampPalette(c(&#8220;white&#8221;, &#8220;blue&#8221;, &#8220;firebrick3&#8243;))(50), fontsize=9, fontsize_row=6,cluster_rows=FALSE, cluster_cols=FALSE)<br />
&gt; pheatmap(data_log2, color = colorRampPalette(c(&#8220;white&#8221;, &#8220;blue&#8221;, &#8220;firebrick3&#8243;))(50), fontsize=9, fontsize_row=6)<br />
&gt; data2=read.csv(/home/shenzy/Desktop/R/Bac.heatmap1.3_GZ.csv)<br />
Error: unexpected &#8216;/&#8217; in &#8220;data2=read.csv(/&#8221;<br />
&gt; data2=read.csv(&#8220;/home/shenzy/Desktop/R/Bac.heatmap1.3_GZ.csv&#8221;,sep=&#8221;\t&#8221;)<br />
&gt; View(data2)<br />
&gt; row.names(data2) &lt;- data2$X.OTU.ID;<br />
&gt; data3=read.csv(&#8220;/home/shenzy/Desktop/R/Bac.heatmap1.4_SH.csv&#8221;,sep=&#8221;\t&#8221;)<br />
&gt; row.names(data3) &lt;- data3$X.OTU.ID;<br />
&gt; View(data3)<br />
&gt; data2_matrix&lt;-data2[,1:7]<br />
&gt; View(data2_matrix)<br />
&gt; data3_matrix&lt;-data3[,1:7]<br />
&gt; View(data3_matrix)<br />
&gt; data2_matrix[is.na(data2_matrix)]&lt;-1<br />
&gt; data3_matrix[is.na(data3_matrix)]&lt;-1<br />
&gt; data2_log2&lt;-log2(data2_matrix)<br />
&gt; data3_log2&lt;-log2(data3_matrix)<br />
&gt; pheatmap(data2_log2, color = colorRampPalette(c(&#8220;white&#8221;, &#8220;blue&#8221;, &#8220;firebrick3&#8243;))(50), fontsize=9, fontsize_row=6,cluster_rows=FALSE, cluster_cols=FALSE)<br />
&gt; pheatmap(data2_log2, color = colorRampPalette(c(&#8220;white&#8221;, &#8220;blue&#8221;, &#8220;firebrick3&#8243;))(50), fontsize=9, fontsize_row=6)<br />
&gt; pheatmap(data3_log2, color = colorRampPalette(c(&#8220;white&#8221;, &#8220;blue&#8221;, &#8220;firebrick3&#8243;))(50), fontsize=9, fontsize_row=6,cluster_rows=FALSE, cluster_cols=FALSE)<br />
&gt; pheatmap(data3_log2, color = colorRampPalette(c(&#8220;white&#8221;, &#8220;blue&#8221;, &#8220;firebrick3&#8243;))(50), fontsize=9, fontsize_row=6)</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2014/02/BACTERIA_1.21.png"><img class="aligncenter size-full wp-image-869" title="BACTERIA_1.2" src="http://www.biofacebook.com/wp-content/uploads/2014/02/BACTERIA_1.21.png" alt="" width="550" height="450" /></a></p>
<p>&nbsp;</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2014/02/BACTERIA_1.2cluster.png"><img class="aligncenter size-full wp-image-870" title="BACTERIA_1.2cluster" src="http://www.biofacebook.com/wp-content/uploads/2014/02/BACTERIA_1.2cluster.png" alt="" width="550" height="450" /></a></p>
]]></content:encoded>
			<wfw:commentRss>https://www.biofacebook.com/?feed=rss2&#038;p=867</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>graphlan a good tool for draw circle picture with tree in it</title>
		<link>https://www.biofacebook.com/?p=858</link>
		<comments>https://www.biofacebook.com/?p=858#comments</comments>
		<pubDate>Fri, 24 Jan 2014 06:54:53 +0000</pubDate>
		<dc:creator><![CDATA[szypanther]]></dc:creator>
				<category><![CDATA[My project]]></category>

		<guid isPermaLink="false">http://www.biofacebook.com/?p=858</guid>
		<description><![CDATA[<p></p> <p></p> <p>#!/bin/sh graphlan_annotate.py hmptree.xml hmptree.annot.xml &#8211;annot annot.txt graphlan.py hmptree.annot.xml hmptree.png &#8211;dpi 150 &#8211;size 14</p> <p>&#160;</p> <p>0.9.5.tar</p> <p>&#160;</p> ]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.biofacebook.com/wp-content/uploads/2014/01/hmptree.png"><img class="aligncenter size-large wp-image-861" title="hmptree" src="http://www.biofacebook.com/wp-content/uploads/2014/01/hmptree-944x1024.png" alt="" width="640" height="694" /></a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2014/01/BacML1.9.hmptree.png"><img class="aligncenter size-large wp-image-859" title="BacML1.9.hmptree" src="http://www.biofacebook.com/wp-content/uploads/2014/01/BacML1.9.hmptree-961x1024.png" alt="" width="640" height="681" /></a></p>
<p>#!/bin/sh<br />
graphlan_annotate.py hmptree.xml hmptree.annot.xml &#8211;annot annot.txt<br />
graphlan.py hmptree.annot.xml hmptree.png &#8211;dpi 150 &#8211;size 14</p>
<p>&nbsp;</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2014/01/0.9.5.tar.gz">0.9.5.tar</a></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>https://www.biofacebook.com/?feed=rss2&#038;p=858</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Batch download protein sequences from CMR (comprehensive microbial resource)</title>
		<link>https://www.biofacebook.com/?p=726</link>
		<comments>https://www.biofacebook.com/?p=726#comments</comments>
		<pubDate>Thu, 28 Feb 2013 08:11:00 +0000</pubDate>
		<dc:creator><![CDATA[szypanther]]></dc:creator>
				<category><![CDATA[My project]]></category>
		<category><![CDATA[服务器管理]]></category>
		<category><![CDATA[生物信息]]></category>
		<category><![CDATA[database]]></category>

		<guid isPermaLink="false">http://www.biofacebook.com/?p=726</guid>
		<description><![CDATA[<p>NCBI 有时批量下载的protein sequence会有不一致时，可以从以下资源数据库下载（eg, eth195)</p> <p>http://cmr.jcvi.org/cgi-bin/CMR/shared/MakeFrontPages.cgi?page=batchdownload</p> <p>&#160;</p> ]]></description>
				<content:encoded><![CDATA[<p>NCBI 有时批量下载的protein sequence会有不一致时，可以从以下资源数据库下载（eg, eth195)</p>
<p>http://cmr.jcvi.org/cgi-bin/CMR/shared/MakeFrontPages.cgi?page=batchdownload</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>https://www.biofacebook.com/?feed=rss2&#038;p=726</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Circos for comparative genomes</title>
		<link>https://www.biofacebook.com/?p=697</link>
		<comments>https://www.biofacebook.com/?p=697#comments</comments>
		<pubDate>Tue, 15 Jan 2013 08:03:30 +0000</pubDate>
		<dc:creator><![CDATA[szypanther]]></dc:creator>
				<category><![CDATA[My project]]></category>
		<category><![CDATA[Circos]]></category>

		<guid isPermaLink="false">http://www.biofacebook.com/?p=697</guid>
		<description><![CDATA[<p>Circos: nucmer &#8211;prefix=refBAV1_qryVScds Dehalococcoides_BAV1.fasta Dehalococcoides_VS.cds.fasta show-tiling -i 80 -c refBAV1_qryVScds.delta &#62; refBAV1_qryVScds.tiling awk -F &#8221; &#8221; &#8216;{print &#8220;chr1&#8243; &#8220;\t&#8221; $1 &#8220;\t&#8221; $2}&#8217; refBAV1_qryVScds.tiling &#62; Dehalococcoidessp.VScds.gene_tableBAV1.txt</p> <p>shenzy@shenzy-ubuntu:/winxp_disk2/shenzy/circos/circos-tutorials-0.62/tutorials/8/1$ perl ../../../../circos-0.62-1/bin/circos -conf circos.conf</p> <p>shenzy@shenzy-ubuntu:/winxp_disk2/shenzy/circos/circos-tutorials-0.62/tutorials/8/1$ ll total 16001 -rwxrwxrwx 1 root root 528519 2012-12-27 16:11 circos1.png -rwxrwxrwx 1 root root 1065925 2012-12-27 14:33 circos2.png -rwxrwxrwx 1 root root 1230394 [...]]]></description>
				<content:encoded><![CDATA[<p>Circos:<br />
nucmer &#8211;prefix=refBAV1_qryVScds Dehalococcoides_BAV1.fasta Dehalococcoides_VS.cds.fasta<br />
show-tiling -i 80 -c refBAV1_qryVScds.delta &gt; refBAV1_qryVScds.tiling<br />
awk -F &#8221; &#8221; &#8216;{print &#8220;chr1&#8243; &#8220;\t&#8221; $1 &#8220;\t&#8221; $2}&#8217; refBAV1_qryVScds.tiling &gt; Dehalococcoidessp.VScds.gene_tableBAV1.txt</p>
<p>shenzy@shenzy-ubuntu:/winxp_disk2/shenzy/circos/circos-tutorials-0.62/tutorials/8/1$ perl ../../../../circos-0.62-1/bin/circos -conf circos.conf</p>
<p>shenzy@shenzy-ubuntu:/winxp_disk2/shenzy/circos/circos-tutorials-0.62/tutorials/8/1$ ll<br />
total 16001<br />
-rwxrwxrwx 1 root root 528519 2012-12-27 16:11 circos1.png<br />
-rwxrwxrwx 1 root root 1065925 2012-12-27 14:33 circos2.png<br />
-rwxrwxrwx 1 root root 1230394 2013-01-15 14:04 circosBAV1_VS11a_genome.jpg<br />
-rwxrwxrwx 1 root root 4654 2013-01-15 13:11 circos.conf<br />
-rwxrwxrwx 1 root root 1119210 2013-01-15 13:36 circos.jpg<br />
-rwxrwxrwx 1 root root 794858 2013-01-15 13:15 circos.png<br />
-rwxrwxrwx 1 root root 2900177 2013-01-15 13:15 circos.svg<br />
-rwxrwxrwx 1 root root 4242152 2013-01-15 13:32 circos.xcf<br />
-rwxrwxrwx 1 root root 4242152 2013-01-15 13:30 circos.xcf.png<br />
-rwxrwxrwx 1 root root 128 2013-01-14 15:01 highlights.conf<br />
-rwxrwxrwx 1 root root 2075 2013-01-14 14:42 highlights.conf.org<br />
-rwxrwxrwx 1 root root 892 2013-01-15 13:15 ideogram.conf<br />
-rwxrwxrwx 1 root root 35132 2011-03-22 00:40 image-01.png<br />
-rwxrwxrwx 1 root root 30883 2011-03-22 00:40 image-02.png<br />
-rwxrwxrwx 1 root root 126878 2011-03-22 00:40 image-03.png<br />
-rwxrwxrwx 1 root root 1305 2012-05-05 00:25 ticks.conf</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2013/01/circos.conf_.txt">circos.conf</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2013/01/ideogram.conf_.txt">ideogram.conf</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2013/01/Dehalococcoidessp.BAV1_.gene_table2.txt">Dehalococcoidessp.BAV1.gene_table2</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2013/01/Dehalococcoidessp.11a.gene_tableBAV1.txt">Dehalococcoidessp.11a.gene_tableBAV1</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2013/01/Dehalococcoidessp.GTcds_.gene_tableBAV1.txt">Dehalococcoidessp.GTcds.gene_tableBAV1</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2013/01/karyotype.microbe.txt">karyotype.microbe</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2013/01/Dehalococcoidessp.CBDB1cds.gene_tableBAV1.txt">Dehalococcoidessp.CBDB1cds.gene_tableBAV1</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2013/01/Dehalococcoidessp.195cds.gene_tableBAV1.txt">Dehalococcoidessp.195cds.gene_tableBAV1</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2013/01/circosBAV1_VS11a_genome.jpg"><img class="aligncenter size-large wp-image-706" title="circosBAV1_VS11a_genome" src="http://www.biofacebook.com/wp-content/uploads/2013/01/circosBAV1_VS11a_genome-1024x1024.jpg" alt="" width="640" height="640" /></a></p>
]]></content:encoded>
			<wfw:commentRss>https://www.biofacebook.com/?feed=rss2&#038;p=697</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to measure codon usage bias</title>
		<link>https://www.biofacebook.com/?p=694</link>
		<comments>https://www.biofacebook.com/?p=694#comments</comments>
		<pubDate>Tue, 15 Jan 2013 07:53:16 +0000</pubDate>
		<dc:creator><![CDATA[szypanther]]></dc:creator>
				<category><![CDATA[My project]]></category>
		<category><![CDATA[生物信息]]></category>
		<category><![CDATA[bioinformatics]]></category>

		<guid isPermaLink="false">http://www.biofacebook.com/?p=694</guid>
		<description><![CDATA[<p>Codon adaptation index (CAI) is one of them. To examine the CAI value of a gene, a reference table of RSCU (relative synonymous codon usage) values for highly expressed genes is compiled.</p> <p>A software call CodonW, you can download it from: http://codonw.sourceforge.net/. There is also a PhD thesis associated to it.</p> <p>shenzy@shenzy-ubuntu:~/Downloads/CondonW/codonW$ codonw input.dat -all_indices [...]]]></description>
				<content:encoded><![CDATA[<p>Codon adaptation index (CAI) is one of them. To examine the CAI value of a gene, a reference table of RSCU (relative synonymous codon usage) values for highly expressed genes is compiled.</p>
<p>A software call CodonW, you can download it from: <a href="http://www.researchgate.net/go.Deref.html?url=http%3A%2F%2Fcodonw.sourceforge.net%2F" target="_blank">http://codonw.sourceforge.net/</a>. There is also a PhD thesis associated to it.</p>
<p>shenzy@shenzy-ubuntu:~/Downloads/CondonW/codonW$ codonw input.dat -all_indices -c_type 2 -f_type 4 -nomenu</p>
<p>eg:</p>
<p>shenzy@shenzy-ubuntu:/winxp_disk2/shenzy/circos/work$ codonw Dehalococcoidessp.BAV1.cds.fasta.dat -all_indices -c_type 2 -f_type 4 -nomenu</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2013/01/aa.png"><img class="aligncenter size-large wp-image-695" title="aa" src="http://www.biofacebook.com/wp-content/uploads/2013/01/aa-1024x176.png" alt="" width="640" height="110" /></a></p>
]]></content:encoded>
			<wfw:commentRss>https://www.biofacebook.com/?feed=rss2&#038;p=694</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Using prcomp/princomp for PCA in R （二）</title>
		<link>https://www.biofacebook.com/?p=590</link>
		<comments>https://www.biofacebook.com/?p=590#comments</comments>
		<pubDate>Fri, 31 Aug 2012 04:08:34 +0000</pubDate>
		<dc:creator><![CDATA[szypanther]]></dc:creator>
				<category><![CDATA[My project]]></category>
		<category><![CDATA[R语言]]></category>
		<category><![CDATA[统计学习]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[work]]></category>

		<guid isPermaLink="false">http://www.biofacebook.com/?p=590</guid>
		<description><![CDATA[<p>###############################</p> <p>PCA ############################### install.packages(&#8220;vegan&#8221;) library(vegan)</p> <p>&#62; STpcoa&#60;-read.table(file=&#8221;bactera_16s_final.subsample.phylip.tre1.weighted.phylip.pcoa.axes&#8221;, header=T,row.names=1) &#62; STpcoa axis1 axis2 axis3 axis4 Cellulose -0.020878 -0.234601 0.167454 0 Foodwaste -0.234592 0.221741 0.085802 0 Sludge 0.368882 0.100725 -0.010570 0 Xylan -0.113413 -0.087865 -0.242686 0 &#62;pl.STpcoa&#60;-princomp(STpcoa) &#62; summary(pl.STpcoa) Importance of components: Comp.1 Comp.2 Comp.3 Comp.4 Standard deviation 0.2260563 0.1746944 0.1536319 0 Proportion of Variance 0.4856521 0.2900347 [...]]]></description>
				<content:encoded><![CDATA[<p>###############################</p>
<p>PCA<br />
###############################<br />
install.packages(&#8220;vegan&#8221;)<br />
library(vegan)</p>
<p>&gt; STpcoa&lt;-read.table(file=&#8221;bactera_16s_final.subsample.phylip.tre1.weighted.phylip.pcoa.axes&#8221;, header=T,row.names=1)<br />
&gt; STpcoa<br />
axis1 axis2 axis3 axis4<br />
Cellulose -0.020878 -0.234601 0.167454 0<br />
Foodwaste -0.234592 0.221741 0.085802 0<br />
Sludge 0.368882 0.100725 -0.010570 0<br />
Xylan -0.113413 -0.087865 -0.242686 0<br />
&gt;pl.STpcoa&lt;-princomp(STpcoa)<br />
&gt; summary(pl.STpcoa)<br />
Importance of components:<br />
Comp.1 Comp.2 Comp.3 Comp.4<br />
Standard deviation 0.2260563 0.1746944 0.1536319 0<br />
Proportion of Variance 0.4856521 0.2900347 0.2243133 0<br />
Cumulative Proportion 0.4856521 0.7756867 1.0000000 1</p>
<p>&gt; ls(pl.STpcoa)<br />
[1] &#8220;call&#8221; &#8220;center&#8221; &#8220;loadings&#8221; &#8220;n.obs&#8221; &#8220;scale&#8221; &#8220;scores&#8221; &#8220;sdev&#8221;<br />
&gt; class(pl.STpcoa)<br />
[1] &#8220;princomp&#8221;<br />
&gt; nmds.col&lt;-c(rep(&#8220;green&#8221;, 1), rep(&#8220;blue&#8221;, 1), rep(&#8220;black&#8221;,1), rep(&#8220;red&#8221;,1))<br />
&gt; plot(pl.STpcoa$scores, col=nmds.col, pch=20)<br />
&gt; legend(x=0.12, y=0.25, c(&#8220;Cellulose&#8221;,&#8221;Foodwaste&#8221;,&#8221;Sludge&#8221;,&#8221;Xylan&#8221;),c(&#8220;green&#8221;,&#8221;blue&#8221;,&#8221;black&#8221;,&#8221;red&#8221;),bty=&#8221;n&#8221;)</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/pca-comp-8-31.png"><img class="alignleft size-full wp-image-591" title="pca-comp-8-31" src="http://www.biofacebook.com/wp-content/uploads/2012/08/pca-comp-8-31.png" alt="" width="750" height="400" /></a><br />
&gt; biplot(pl.STpcoa)</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/bioCOMP.jpeg"><img class="alignleft size-full wp-image-592" title="bioCOMP" src="http://www.biofacebook.com/wp-content/uploads/2012/08/bioCOMP.jpeg" alt="" width="750" height="400" /></a><br />
##############################<br />
&gt; pl2.STpcoa&lt;-prcomp(STpcoa)<br />
&gt; class(pl2.STpcoa)<br />
[1] &#8220;prcomp&#8221;<br />
&gt; pl2.STpcoa$sd^2<br />
[1] 0.06813525 0.04069083 0.03147035 0.00000000<br />
&gt; summary(pl2.STpcoa)<br />
Importance of components:<br />
PC1 PC2 PC3 PC4<br />
Standard deviation 0.2610 0.2017 0.1774 0<br />
Proportion of Variance 0.4857 0.2900 0.2243 0<br />
Cumulative Proportion 0.4857 0.7757 1.0000 1<br />
&gt; pl2.STpcoa$x<br />
PC1 PC2 PC3 PC4<br />
Cellulose -0.02087762 -0.2346017 0.16745306 0<br />
Foodwaste -0.23459165 0.2217407 0.08580311 0<br />
Sludge 0.36888225 0.1007250 -0.01056992 0<br />
Xylan -0.11341297 -0.0878640 -0.24268626 0<br />
&gt; pl2.STpcoa$rotation<br />
PC1 PC2 PC3 PC4<br />
axis1 1.000000e+00 -9.352971e-08 -8.835157e-07 0<br />
axis2 9.353330e-08 1.000000e+00 4.064575e-06 0<br />
axis3 8.835153e-07 -4.064576e-06 1.000000e+00 0<br />
axis4 0.000000e+00 0.000000e+00 0.000000e+00 1<br />
&gt; plot(pl2.STpcoa$x, col=nmds.col, pch=20)<br />
&gt; legend(x=0.12, y=0.25, c(&#8220;Cellulose&#8221;,&#8221;Foodwaste&#8221;,&#8221;Sludge&#8221;,&#8221;Xylan&#8221;),c(&#8220;green&#8221;,&#8221;blue&#8221;,&#8221;black&#8221;,&#8221;red&#8221;),bty=&#8221;n&#8221;)</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/pca-pc-8-31.png"><img class="alignleft size-full wp-image-593" title="pca-pc-8-31" src="http://www.biofacebook.com/wp-content/uploads/2012/08/pca-pc-8-31.png" alt="" width="750" height="400" /></a><br />
&gt; screeplot(pl2.STpcoa,type=&#8221;lines&#8221;,main=&#8221;Scree Plot&#8221;)</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/Scree-Plot-PC-8-31.jpeg"><img class="alignleft size-full wp-image-594" title="Scree-Plot-PC-8-31" src="http://www.biofacebook.com/wp-content/uploads/2012/08/Scree-Plot-PC-8-31.jpeg" alt="" width="750" height="400" /></a><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/bactera_pcoa.axes_.txt">bactera_16s_final.subsample.phylip.tre1.weighted.phylip.pcoa.axes</a></p>
]]></content:encoded>
			<wfw:commentRss>https://www.biofacebook.com/?feed=rss2&#038;p=590</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Using prcomp/princomp for PCA in R （一）</title>
		<link>https://www.biofacebook.com/?p=587</link>
		<comments>https://www.biofacebook.com/?p=587#comments</comments>
		<pubDate>Fri, 31 Aug 2012 04:00:07 +0000</pubDate>
		<dc:creator><![CDATA[szypanther]]></dc:creator>
				<category><![CDATA[My project]]></category>
		<category><![CDATA[R语言]]></category>
		<category><![CDATA[统计学习]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[work]]></category>

		<guid isPermaLink="false">http://www.biofacebook.com/?p=587</guid>
		<description><![CDATA[<p>Difference between prcomp and princomp:</p> <p>&#8216;princomp&#8217; can only be used with more units than variables&#8221;</p> <p>prcomp是基于SVD分解（svd()函数，princomp是基于特征向量eigen()函数)</p> <p>Good video source:</p> <p>http://www.youtube.com/watch?v=oZ2nfIPdvjY</p> <p>http://www.youtube.com/watch?v=I5GxNzKLIoU&#038;feature=relmfu</p> <p>http://www.planta.cn/forum/viewtopic.php?t=16754&#038;highlight=%D3%EF%D1%D4</p> <p>###########################################</p> <p>以下所有代码包括练习数据，都可在R平台上直接运行。</p> <p>#主成分分析和主成分回归 主成分分析的思想是Pearson 1901年提出的，Hotelling 1933进一步发展 在R中，进行主成分分析用到princomp() 函数</p> <p>用法 princomp(x, cor = FALSE, scores = TRUE, covmat = NULL, subset = rep(TRUE, nrow(as.matrix(x))), &#8230;)</p> <p># 分析用数据 # cor 是否用样本的协方差矩阵作主成分分析 prcomp() 二 summary()函数 三 [...]]]></description>
				<content:encoded><![CDATA[<p>Difference between prcomp and princomp:</p>
<p>&#8216;princomp&#8217; can only be used with more units than variables&#8221;</p>
<p>prcomp是基于SVD分解（svd()函数，princomp是基于特征向量eigen()函数)</p>
<p>Good video source:</p>
<p>http://www.youtube.com/watch?v=oZ2nfIPdvjY</p>
<p>http://www.youtube.com/watch?v=I5GxNzKLIoU&#038;feature=relmfu</p>
<p>http://www.planta.cn/forum/viewtopic.php?t=16754&#038;highlight=%D3%EF%D1%D4</p>
<p>###########################################</p>
<p>以下所有代码包括练习数据，都可在R平台上直接运行。</p>
<p>#主成分分析和主成分回归<br />
主成分分析的思想是Pearson 1901年提出的，Hotelling 1933进一步发展<br />
在R中，进行主成分分析用到princomp() 函数</p>
<p>用法<br />
princomp(x, cor = FALSE, scores = TRUE, covmat = NULL,<br />
subset = rep(TRUE, nrow(as.matrix(x))), &#8230;)</p>
<p># 分析用数据<br />
# cor 是否用样本的协方差矩阵作主成分分析<br />
prcomp()<br />
二 summary()函数<br />
三 loadings()函数<br />
四 predict() 函数<br />
五 screeplot() 函数<br />
六 biplot() 函数</p>
<p>实例<br />
某中学随机抽取某年级30名学生，测量其身高，体重，胸围，坐高，针对这30名中学生身体四项指标数据做主成分分析。<br />
student&lt;-data.frame(<br />
X1=c(148, 139, 160, 149, 159, 142, 153, 150, 151, 139,<br />
140, 161, 158, 140, 137, 152, 149, 145, 160, 156,<br />
151, 147, 157, 147, 157, 151, 144, 141, 139, 148 ),<br />
X2=c(41, 34, 49, 36, 45, 31, 43, 43, 42, 31,<br />
29, 47, 49, 33, 31, 35, 47, 35, 47, 44,<br />
42, 38, 39, 30, 48, 36, 36, 30, 32, 38 ),<br />
X3=c(72, 71, 77, 67, 80, 66, 76, 77, 77, 68,<br />
64, 78, 78, 67, 66, 73, 82, 70, 74, 78,<br />
73, 73, 68, 65, 80, 74, 68, 67, 68, 70),<br />
X4=c(78, 76, 86, 79, 86, 76, 83, 79, 80, 74,<br />
74, 84, 83, 77, 73, 79, 79, 77, 87, 85,<br />
82, 78, 80, 75, 88, 80, 76, 76, 73, 78 )<br />
)<br />
#主成分分析<br />
student.pr &lt;- princomp(student, cor = TRUE)<br />
#显示结果<br />
summary(student.pr, loadings=TRUE)<br />
#预测，显示各样本主成分的值<br />
pre&lt;-predict(student.pr)<br />
#显示碎石图<br />
screeplot(student.pr,type=&#8221;lines&#8221;)<br />
# 主成分分析散点图<br />
biplot(student.pr)</p>
<p>例二<br />
对128个成年男子的身材进行测量，每人测得16项指标，身高，坐高，胸围，头高，裤长，下档，手长，领围，前胸，后背，肩厚，肩宽，袖长，肋围，腰围，腿肚，分别用X1-X16表示。16项指标的相关矩阵R。从相关矩阵出发进行主成分分析，随16项指标进行分类。<br />
命令<br />
x&lt;-c(<br />
1.00,<br />
0.79, 1.00,<br />
0.36, 0.31, 1.00,<br />
0.96, 0.74, 0.38, 1.00,<br />
0.89, 0.58, 0.31, 0.90, 1.00,<br />
0.79, 0.58, 0.30, 0.78, 0.79, 1.00,<br />
0.76, 0.55, 0.35, 0.75, 0.74, 0.73, 1.00,<br />
0.26, 0.19, 0.58, 0.25, 0.25, 0.18, 0.24, 1.00,<br />
0.21, 0.07, 0.28, 0.20, 0.18, 0.18, 0.29,-0.04, 1.00,<br />
0.26, 0.16, 0.33, 0.22, 0.23, 0.23, 0.25, 0.49,-0.34, 1.00,<br />
0.07, 0.21, 0.38, 0.08,-0.02, 0.00, 0.10, 0.44,-0.16, 0.23, 1.00,<br />
0.52, 0.41, 0.35, 0.53, 0.48, 0.38, 0.44, 0.30,-0.05, 0.50, 0.24, 1.00,<br />
0.77, 0.47, 0.41, 0.79, 0.79, 0.69, 0.67, 0.32, 0.23, 0.31, 0.10, 0.62, 1.00,<br />
0.25, 0.17, 0.64, 0.27, 0.27, 0.14, 0.16, 0.51, 0.21, 0.15, 0.31, 0.17, 0.26, 1.00,<br />
0.51, 0.35, 0.58, 0.57, 0.51, 0.26, 0.38, 0.51, 0.15, 0.29, 0.28, 0.41, 0.50, 0.63, 1.00,<br />
0.21, 0.16, 0.51, 0.26, 0.23, 0.00, 0.12, 0.38, 0.18, 0.14, 0.31, 0.18, 0.24, 0.50, 0.65, 1.00<br />
)<br />
names&lt;-c(&#8220;X1&#8243;, &#8220;X2&#8243;, &#8220;X3&#8243;, &#8220;X4&#8243;, &#8220;X5&#8243;, &#8220;X6&#8243;, &#8220;X7&#8243;, &#8220;X8&#8243;, &#8220;X9&#8243;,<br />
&#8220;X10&#8243;, &#8220;X11&#8243;, &#8220;X12&#8243;, &#8220;X13&#8243;, &#8220;X14&#8243;, &#8220;X15&#8243;, &#8220;X16&#8243;)<br />
R&lt;-matrix(0, nrow=16, ncol=16, dimnames=list(names, names))<br />
for (i in 1:16){<br />
for (j in 1:i){<br />
R&lt;-x[(i-1)*i/2+j]; R[j,i]&lt;-R<br />
}<br />
}<br />
#主成分分析<br />
pr&lt;-princomp(covmat=R)<br />
load&lt;-loadings(pr)</p>
<p>#<br />
plot(load[,1:2])<br />
text(load[,1], load[,2], adj=c(-0.4, 0.3))</p>
<p>主成分回归<br />
考虑进口总额Y与三个自变量：国内总产值，存储量，总消费量之间的关系。现收集了1949-1959共11年的数据，试做线性回归和主成分回归分析。<br />
conomy&lt;-data.frame(<br />
x1=c(149.3, 161.2, 171.5, 175.5, 180.8, 190.7, 202.1, 212.4, 226.1, 231.9, 239.0),<br />
x2=c(4.2, 4.1, 3.1, 3.1, 1.1, 2.2, 2.1, 5.6, 5.0, 5.1, 0.7),<br />
x3=c(108.1, 114.8, 123.2, 126.9, 132.1, 137.7, 146.0, 154.1, 162.3, 164.3, 167.6),<br />
y=c(15.9, 16.4, 19.0, 19.1, 18.8, 20.4, 22.7, 26.5, 28.1, 27.6, 26.3)<br />
)</p>
<p>线性回归<br />
lm.sol&lt;-lm(y~x1+x2+x3, data=conomy)<br />
summary(lm.sol)<br />
主成分回归</p>
<p># 主成分分析<br />
conomy.pr&lt;-princomp(~x1+x2+x3, data=conomy, cor=T)<br />
summary(conomy.pr, loadings=TRUE)<br />
pre&lt;-predict(conomy.pr)<br />
conomy$z1&lt;-pre[,1]; conomy$z2&lt;-pre[,2]<br />
lm.sol&lt;-lm(y~z1+z2, data=conomy)<br />
summary(lm.sol)</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/pca_135.jpeg"><img class="alignleft size-full wp-image-588" title="pca_135" src="http://www.biofacebook.com/wp-content/uploads/2012/08/pca_135.jpeg" alt="" width="558" height="557" /></a></p>
]]></content:encoded>
			<wfw:commentRss>https://www.biofacebook.com/?feed=rss2&#038;p=587</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>4sample CA RDA analysis</title>
		<link>https://www.biofacebook.com/?p=559</link>
		<comments>https://www.biofacebook.com/?p=559#comments</comments>
		<pubDate>Thu, 30 Aug 2012 04:14:47 +0000</pubDate>
		<dc:creator><![CDATA[szypanther]]></dc:creator>
				<category><![CDATA[My project]]></category>
		<category><![CDATA[R语言]]></category>
		<category><![CDATA[统计学习]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[work]]></category>

		<guid isPermaLink="false">http://www.biofacebook.com/?p=559</guid>
		<description><![CDATA[<p>&#160;</p> <p>&#62; gtsdata_test=read.table(&#8220;gtsdata.txt&#8221;, header=T) &#62; gtsenv=read.table(&#8220;gtsenv.txt&#8221;, header=T) &#62; gtsdata_data_t&#60;-t(gtsdata_data) &#62; decorana(gtsdata_data_t)</p> <p>Call: decorana(veg = gtsdata_data_t)</p> <p>Detrended correspondence analysis with 26 segments. Rescaling of axes with 4 iterations.</p> <p>DCA1 DCA2 DCA3 DCA4 Eigenvalues 0.8634 0.4834 0.23788 0 Decorana values 0.8721 0.3793 0.07223 0 Axis lengths 5.3292 2.1115 1.80907 0</p> <p>&#62; gts.ca=cca(gtsdata_data_t) &#62; gts.ca Call: cca(X = [...]]]></description>
				<content:encoded><![CDATA[<p>&nbsp;</p>
<p>&gt; gtsdata_test=read.table(&#8220;gtsdata.txt&#8221;, header=T)<br />
&gt; gtsenv=read.table(&#8220;gtsenv.txt&#8221;, header=T)<br />
&gt; gtsdata_data_t&lt;-t(gtsdata_data)<br />
&gt; decorana(gtsdata_data_t)</p>
<p>Call:<br />
decorana(veg = gtsdata_data_t)</p>
<p>Detrended correspondence analysis with 26 segments.<br />
Rescaling of axes with 4 iterations.</p>
<p>DCA1 DCA2 DCA3 DCA4<br />
Eigenvalues 0.8634 0.4834 0.23788 0<br />
Decorana values 0.8721 0.3793 0.07223 0<br />
Axis lengths 5.3292 2.1115 1.80907 0</p>
<p>&gt; gts.ca=cca(gtsdata_data_t)<br />
&gt; gts.ca<br />
Call: cca(X = gtsdata_data_t)</p>
<p>Inertia Rank<br />
Total 1.653<br />
Unconstrained 1.653 3<br />
Inertia is mean squared contingency coefficient</p>
<p>Eigenvalues for unconstrained axes:<br />
CA1 CA2 CA3<br />
0.8721 0.5037 0.2776</p>
<p>&gt; plot(gts.ca,scaling=3)</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/4sampleCA-analysis.png"><img class="alignleft size-full wp-image-560" title="4sampleCA-analysis" src="http://www.biofacebook.com/wp-content/uploads/2012/08/4sampleCA-analysis.png" alt="" width="750" height="400" /></a></p>
<p>&gt; gtsdata_data_t_del&lt;-gtsdata_data_t[1:3,]<br />
&gt; gtsdata_data_t_del</p>
<p>&gt; gts.rda=rda(gtsdata_data_t_del,gtsenv)<br />
&gt; gts.rda<br />
Call: rda(X = gtsdata_data_t_del, Y = gtsenv)</p>
<p>Inertia Proportion Rank<br />
Total 101790 1<br />
Constrained 101790 1 2<br />
Unconstrained 0 0 0<br />
Inertia is variance<br />
Some constraints were aliased because they were collinear (redundant)</p>
<p>Eigenvalues for constrained axes:<br />
RDA1 RDA2<br />
81240 20549</p>
<p>plot(gts.rda,display=c(&#8220;sp&#8221;,&#8221;bp&#8221;,&#8221;si&#8221;),scaling=3)</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/3sampleRDA-analysis.png"><img class="alignleft size-full wp-image-561" title="3sampleRDA-analysis" src="http://www.biofacebook.com/wp-content/uploads/2012/08/3sampleRDA-analysis.png" alt="" width="750" height="400" /></a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/gtsenv2.txt">gtsenv.txt</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/4sample_R_cluster.csv">gtsdata.txt</a></p>
]]></content:encoded>
			<wfw:commentRss>https://www.biofacebook.com/?feed=rss2&#038;p=559</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>基于Vegan 软件包的生态学数据排序分析学习</title>
		<link>https://www.biofacebook.com/?p=531</link>
		<comments>https://www.biofacebook.com/?p=531#comments</comments>
		<pubDate>Tue, 28 Aug 2012 04:40:20 +0000</pubDate>
		<dc:creator><![CDATA[szypanther]]></dc:creator>
				<category><![CDATA[My project]]></category>
		<category><![CDATA[R语言]]></category>
		<category><![CDATA[统计学习]]></category>
		<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[work]]></category>

		<guid isPermaLink="false">http://www.biofacebook.com/?p=531</guid>
		<description><![CDATA[<p>“基于Vegan 软件包的生态学数据排序分析 赖江山 米湘成 (中国科学院植物研究所植被与环境变化国家重点实验室，北京 100093) 摘要：群落学数据一般是多维数据，例如物种属性或环境因子的属性。多元统计分析是群落生态学常用的分析方法，排序（ordination）是多元统计最常用的方法之一。CANOCO是广泛使用的排序软件，但缺点是商业软件价格不菲，版本更新速度也很慢。近年来，R语言以其灵活、开放、易于掌握、免费等诸多优点，在生态学和生物多样性研究领域迅速赢得广大研究人员的青睐。R语言中的外在软件包“Vegan”是专门用于群落生态学分析的工具。Vegan能够提供所有基本的排序方法，同时具有生成精美排序图的功能，版本更新很快。我们认为Vegan包完全可以取代CANOCO，成为今后排序分析的首选统计工具。本文首先简述排序的原理和类型，然后介绍Vegan的基本信息和下载安装过程，最后以古田山24公顷样地内随机抽取40个20m×20m的样方为例，展示Vegan包内各种常用排序方法（PCA,RDA,CA和CCA）和排序图生成过程，希望能为R的初学者尽快熟悉并利用Vegan包进行排序分析提供参考。</p> <p>gtsdata</p> <p>gtsenv.txt</p> <p>赖江山.pdf</p> &#62; setwd("/winxp_disk2/shenzy/R/Vegan") &#62; gtsdata=read.table("gtsdata.txt", header=T) &#62; gtsenv=read.table("gtsenv.txt", header=T) &#62; install.packages("vegan") Installing package(s) into ‘/home/shenzy/R/x86_64-pc-linux-gnu-library/2.15’ (as ‘lib’ is unspecified) 试开URL’http://cran.csiro.au/src/contrib/vegan_2.0-4.tar.gz' Content type 'application/x-gzip' length 1576584 bytes (1.5 Mb) 打开了URL ================================================== downloaded 1.5 Mb * installing *source* package ‘vegan’ ... ** 成功将‘vegan’程序包解包并MD5和检查 ** libs gfortran -fpic -O3 [...]]]></description>
				<content:encoded><![CDATA[<p>“基于Vegan 软件包的生态学数据排序分析<br />
赖江山 米湘成<br />
(中国科学院植物研究所植被与环境变化国家重点实验室，北京 100093)<br />
摘要：群落学数据一般是多维数据，例如物种属性或环境因子的属性。多元统计分析是群落生态学常用的分析方法，排序（ordination）是多元统计最常用的方法之一。CANOCO是广泛使用的排序软件，但缺点是商业软件价格不菲，版本更新速度也很慢。近年来，R语言以其灵活、开放、易于掌握、免费等诸多优点，在生态学和生物多样性研究领域迅速赢得广大研究人员的青睐。R语言中的外在软件包“Vegan”是专门用于群落生态学分析的工具。Vegan能够提供所有基本的排序方法，同时具有生成精美排序图的功能，版本更新很快。我们认为Vegan包完全可以取代CANOCO，成为今后排序分析的首选统计工具。本文首先简述排序的原理和类型，然后介绍Vegan的基本信息和下载安装过程，最后以古田山24公顷样地内随机抽取40个20m×20m的样方为例，展示Vegan包内各种常用排序方法（PCA,RDA,CA和CCA）和排序图生成过程，希望能为R的初学者尽快熟悉并利用Vegan包进行排序分析提供参考。</p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/gtsdata.txt">gtsdata</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/gtsenv1.txt">gtsenv.txt</a></p>
<p><a href="http://www.biofacebook.com/wp-content/uploads/2012/08/赖江山.pdf">赖江山.pdf</a></p>
<pre>&gt; setwd("/winxp_disk2/shenzy/R/Vegan")
&gt; gtsdata=read.table("gtsdata.txt", header=T)
&gt; gtsenv=read.table("gtsenv.txt", header=T)
&gt; install.packages("vegan")
Installing package(s) into ‘/home/shenzy/R/x86_64-pc-linux-gnu-library/2.15’
(as ‘lib’ is unspecified)
试开URL’http://cran.csiro.au/src/contrib/vegan_2.0-4.tar.gz'
Content type 'application/x-gzip' length 1576584 bytes (1.5 Mb)
打开了URL
==================================================
downloaded 1.5 Mb

* installing *source* package ‘vegan’ ...
** 成功将‘vegan’程序包解包并MD5和检查
** libs
gfortran   -fpic  -O3 -pipe  -g  -c cepin.f -o cepin.o
gcc -std=gnu99 -I/usr/share/R/include -DNDEBUG      -fpic  -O3 -pipe  -g  -c data2hill.c -o data2hill.o
gfortran   -fpic  -O3 -pipe  -g  -c decorana.f -o decorana.o
gcc -std=gnu99 -I/usr/share/R/include -DNDEBUG      -fpic  -O3 -pipe  -g  -c goffactor.c -o goffactor.o
gfortran   -fpic  -O3 -pipe  -g  -c monoMDS.f -o monoMDS.o
gcc -std=gnu99 -I/usr/share/R/include -DNDEBUG      -fpic  -O3 -pipe  -g  -c nestedness.c -o nestedness.o
gfortran   -fpic  -O3 -pipe  -g  -c ordering.f -o ordering.o
gcc -std=gnu99 -I/usr/share/R/include -DNDEBUG      -fpic  -O3 -pipe  -g  -c pnpoly.c -o pnpoly.o
gcc -std=gnu99 -I/usr/share/R/include -DNDEBUG      -fpic  -O3 -pipe  -g  -c stepacross.c -o stepacross.o
gcc -std=gnu99 -I/usr/share/R/include -DNDEBUG      -fpic  -O3 -pipe  -g  -c vegdist.c -o vegdist.o
gcc -std=gnu99 -shared -o vegan.so cepin.o data2hill.o decorana.o goffactor.o monoMDS.o nestedness.o ordering.o pnpoly.o stepacross.o vegdist.o -lgfortran -lm -lquadmath -L/usr/lib/R/lib -lR
安装至 /home/shenzy/R/x86_64-pc-linux-gnu-library/2.15/vegan/libs
** R
** data
** inst
** preparing package for lazy loading
** help
*** installing help indices
** building package indices
** installing vignettes
   ‘decision-vegan.Rnw’
   ‘diversity-vegan.Rnw’ using ‘UTF-8’
   ‘intro-vegan.Rnw’ using ‘UTF-8’
** testing if installed package can be loaded

* DONE (vegan)

The downloaded source packages are in
	‘/tmp/RtmpmtXtEK/downloaded_packages’
&gt; library(vegan)
载入需要的程辑包：permute

载入程辑包：‘permute’

The following object(s) are masked from ‘package:gtools’:

    permute

This is vegan 2.0-4
&gt; decorana(gtsdata)

Call:
decorana(veg = gtsdata) 

Detrended correspondence analysis with 26 segments.
Rescaling of axes with 4 iterations.

                  DCA1   DCA2    DCA3    DCA4
Eigenvalues     0.3939 0.2239 0.09555 0.06226
Decorana values 0.5025 0.1756 0.06712 0.03877
Axis lengths    3.2595 2.5130 1.21445 1.00854

&gt; gts.pca=rda(gtsdata)
&gt; gts.pca
Call: rda(X = gtsdata)

              Inertia Rank
Total           352.1
Unconstrained   352.1   22
Inertia is variance 

Eigenvalues for unconstrained axes:
    PC1     PC2     PC3     PC4     PC5     PC6     PC7     PC8
111.779  73.580  54.607  32.959  26.481  18.063  12.763   7.637
(Showed only 8 of all 22 unconstrained eigenvalues)

Note: 通过以上命令选择排序模型（线性模型PCA、RDA或单峰模型CA、CCA），因为Axis lengths 等同于CANOCO中的DCA分析，
DCA排序数值最大max&gt;4选单峰，&lt;3 选线性模型， 3&lt;max&lt;4 则都二者都行，这里3.2595属于此情况！总的特征根为352.1，即物种分布
的总变化量。PCA排序为每个抽所能解释的方差变化量。对于第一抽而言，111.779/352.1=31.7% 为其物种分布的解释量。</pre>
<pre>&gt; plot(gts.pca)
<a href="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot0.png"><img class="alignleft size-full wp-image-538" title="Rplot0" src="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot0.png" alt="" width="750" height="400" /></a></pre>
<p>&nbsp;</p>
<pre> &gt;biplot(gts.pca)
<a href="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot1.png"><img class="alignleft size-full wp-image-539" title="Rplot1" src="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot1.png" alt="" width="750" height="400" /></a></pre>
<p>因以上重叠现象严重，原因是物种分布差异打，分布不均匀的物种占据了大部分排序空间，可对物种数据进行单位方差标准化。通过scale参数实现，如下：</p>
<pre>&gt; gts.pca=rda(gtsdata, scale=T)
&gt; biplot(gts.pca,scaling=3)
<a href="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot2.png"><img class="alignleft size-full wp-image-542" title="Rplot2" src="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot2.png" alt="" width="750" height="400" /></a></pre>
<pre>Note:scaling=1 关注物种间关系</pre>
<pre>scaling=2 关注样方之间关系
scaling=3 关注样方与物种之间关系</pre>
<pre>&gt; biplot(gts.pca,display="sp")</pre>
<pre>&gt; biplot(gts.pca,display="si")</pre>
<pre>&gt; biplot(gts.pca,display="sp", choices=c(1,3))
<a href="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot3.png"><img class="alignleft size-full wp-image-544" title="Rplot3" src="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot3.png" alt="" width="750" height="400" /></a></pre>
<pre></pre>
<p>CA分析：</p>
<pre>&gt; gts.ca=cca(gtsdata)
&gt; gts.ca
Call: cca(X = gtsdata)

              Inertia Rank
Total           1.424
Unconstrained   1.424   21
Inertia is mean squared contingency coefficient 

Eigenvalues for unconstrained axes:
    CA1     CA2     CA3     CA4     CA5     CA6     CA7     CA8
0.50253 0.26564 0.14023 0.10502 0.09127 0.05540 0.05063 0.04204
(Showed only 8 of all 21 unconstrained eigenvalues)</pre>
<pre>&gt; plot(gts.ca,scaling=3)
<a href="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot4.png"><img class="alignleft size-full wp-image-546" title="Rplot4" src="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot4.png" alt="" width="750" height="400" /></a></pre>
<pre>从CA解读即：如某一个物种靠近某个样方，表明该物种可能对样方位置起很大作用。从图可以看出20号样方与短柄饱（QUESER）很近。同时19与20号样方距离近，表明物种组结构特征也近！而只有少数样方出现的物种，如CASCAR，通常在排序空间边缘，表明只偶然发生。该列对应样方数值都很小或0！对在排序中心的物种，可能在取样区域是其最优分布。对应该列（CASERY）数值较大而多！</pre>
<pre>
RDA分析（多个矩阵分析）：</pre>
<pre>&gt; gts.rda=rda(gtsdata,gtsenv)
&gt; gts.rda
Call: rda(X = gtsdata, Y = gtsenv)

               Inertia Proportion Rank
Total         352.0917     1.0000
Constrained   137.4026     0.3902    8
Unconstrained 214.6891     0.6098   22
Inertia is variance 

Eigenvalues for constrained axes:
   RDA1    RDA2    RDA3    RDA4    RDA5    RDA6    RDA7    RDA8
56.3864 42.7769 17.8270 13.5066  2.5020  2.1217  1.6616  0.6203 

Eigenvalues for unconstrained axes:
   PC1    PC2    PC3    PC4    PC5    PC6    PC7    PC8
72.287 54.891 26.618 17.959 12.730  9.918  5.659  5.349
(Showed only 8 of all 22 unconstrained eigenvalues)</pre>
<pre>plot(gts.rda,display=c("sp","bp","si"),scaling=3)
<a href="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot6.png"><img class="alignleft size-full wp-image-548" title="Rplot6" src="http://www.biofacebook.com/wp-content/uploads/2012/08/Rplot6.png" alt="" width="750" height="400" /></a></pre>
<pre>在RDA排序中，箭头连线长度代表某个</pre>
<pre>环境因子与群落分布和种类分布间相关
程度的大小，越长相关性越大。
箭头连线和排序抽的夹角代表某个环境因子
与排序抽的相关性大小，越小相关性越大！</pre>
<pre>&gt; gts.prda=rda(gtsdata,gtsenv[,1:4], gtsenv[,5:8])
&gt; gts.prda
Call: rda(X = gtsdata, Y = gtsenv[, 1:4], Z = gtsenv[, 5:8])

               Inertia Proportion Rank
Total         352.0917     1.0000
Conditional    95.0318     0.2699    4
Constrained    42.3708     0.1203    4
Unconstrained 214.6891     0.6098   22
Inertia is variance 

Eigenvalues for constrained axes:
  RDA1   RDA2   RDA3   RDA4
27.522  9.087  3.442  2.320 

Eigenvalues for unconstrained axes:
   PC1    PC2    PC3    PC4    PC5    PC6    PC7    PC8
72.287 54.891 26.618 17.959 12.730  9.918  5.659  5.349
(Showed only 8 of all 22 unconstrained eigenvalues)
Note: gtsenv[,1:4]表示环境矩阵只取前4列，即地形因子。Constrained为42.37除以
352.09=12.03%，表示地形因子单独所能解释的特征根占总特征根的百分比。Y，Z调换下，
可得土壤因子单独的解释量，2者总共的解释量前面已经算出，即为39.02%。所以2组环境
变量共同的解释量为39.02%-15.53%-12.03%=11.46%!

CCA分析类似
&gt;gts.cca=cca(gtsdata,gtsenv)</pre>
<pre></pre>
]]></content:encoded>
			<wfw:commentRss>https://www.biofacebook.com/?feed=rss2&#038;p=531</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
