<u id="vqgcl"><tbody id="vqgcl"></tbody></u>

<blockquote id="vqgcl"><form id="vqgcl"></form></blockquote>

溫馨提示×

溫馨提示×

您好，登錄后才能下訂單哦！

密碼登錄×

忘記密碼？

登錄注冊(cè)×

獲取短信驗(yàn)證碼

其他方式登錄

點(diǎn)擊登錄注冊(cè) 即表示同意《億速云用戶服務(wù)條款》

用戶登錄×

賬戶密碼登錄

請(qǐng)使用微信掃描上方二維碼

使用幫助

請(qǐng)求超時(shí)！

請(qǐng)點(diǎn)擊重新獲取二維碼

可以做structure的R語言包LEA是怎樣的

發(fā)布時(shí)間：2021-11-22 09:43:19 來源：億速云閱讀：178 作者：柒染欄目：大數(shù)據(jù)

這期內(nèi)容當(dāng)中小編將會(huì)給大家?guī)碛嘘P(guān)可以做structure的R語言包LEA是怎樣的，文章內(nèi)容豐富且以專業(yè)的角度為大家分析和敘述，閱讀完這篇文章希望大家可以有所收獲。

關(guān)于分群的軟件，之前寫了structure 2.3.4 軟件使用指南，軟件雖然有windows版本，但是操作太麻煩了，也寫了Admixture使用說明文檔cookbook，但是只有Linux版本，使用起來有難度。難道不能使用R語言進(jìn)行structure繪圖么？結(jié)果來了：LEA！

1. paper

LEA: An R package for landscape and ecological association studies

使用說明文檔

不同格式的數(shù)據(jù)使用LEA

2. 軟件介紹

This short tutorial explains how population structure analyses reproducing the results of the widely-used computer program structure can be performed using commands in the R language. The method works for any operating systems, and it does not require the installation
of structure or additional computer programs. The R program allows running population structure inference algorithms, choosing the number of clusters, and showing admixture coefficient bar-plots using a few commands. The methods used by R are fast and accurate, and they
are free of standard population genetic equilibrium hypotheses. In addition, these methods allow their users to play with a large panel of graphical functions for displaying pie-charts and interpolated admixture coefficients on geographic maps.

劃重點(diǎn):

可以在R語言中實(shí)現(xiàn)軟件Structure的功能
可以做類似admixture的圖
簡(jiǎn)單操作, 幾個(gè)命令實(shí)現(xiàn)相關(guān)功能
C語言開發(fā), 可以處理大數(shù)據(jù)

3. 軟件安裝

install.packages(c("fields","RColorBrewer","mapplots"))
source("http://bioconductor.org/biocLite.R")
biocLite("LEA")

如果安裝不成功, 也可以通過CRAN把軟件包下載到本地, 進(jìn)行安裝:

install.packages("LEA_1.4.0_tar.gz", repos = NULL, type ="source")

載入兩個(gè)函數(shù), 進(jìn)行格式轉(zhuǎn)化以及可視化:

source("http://membres-timc.imag.fr/Olivier.Francois/Conversion.R")
source("http://membres-timc.imag.fr/Olivier.Francois/POPSutilities.R")

4. 測(cè)試數(shù)據(jù)

plink格式的ped文件, 具體格式參考:plink格式的ped和map文件及轉(zhuǎn)化為012的方法

1 SAMPLE0 0 0 2 2 1 2 3 3 1 1 2 1
2 SAMPLE1 0 0 1 2 2 1 1 3 0 4 1 1
3 SAMPLE2 0 0 2 1 2 2 3 3 1 4 1 1

前六列為:
家系ID
個(gè)體ID
父本
母本
性別
表型值
SNP1-1(SNP1的第一個(gè)位點(diǎn))
SNP1-2(SNP的第二個(gè)位點(diǎn))

測(cè)試數(shù)據(jù)采用admixture的示例數(shù)據(jù), 使用plink將其轉(zhuǎn)化為ped文件

library(LEA)
# 結(jié)果會(huì)生成test.geno文件的數(shù)據(jù).
output = ped2lfmm("test.ped")

# 使用LEA進(jìn)行structure進(jìn)行分析
library(LEA)
obj.snmf = snmf("test.geno", K = 3, alpha = 100, project = "new")
qmatrix = Q(obj.snmf, K = 3)
head(qmatrix)
barplot(t(qmatrix), col = rainbow(3), border = NA, space = 0,
        xlab = "Individuals", ylab = "Admixture coefficients")

可以做structure的R語言包LEA是怎樣的

對(duì)比admixture的結(jié)果

# 對(duì)比admixture結(jié)果
qad = read.table("test.3.Q")
head(qad)
barplot(t(qad), col = rainbow(3), border = NA, space = 0,
        xlab = "Individuals", ylab = "Admixture coefficients")

可以做structure的R語言包LEA是怎樣的

5. 使用`snmf`選擇最優(yōu)K值

# 繪制折線圖, 選擇最優(yōu)K值.
plot(project, col = "blue", pch = 19, cex = 1.2)

可以做structure的R語言包LEA是怎樣的
可以看出, K=3時(shí), 最小, 因此選擇K=3.

上述就是小編為大家分享的可以做structure的R語言包LEA是怎樣的了，如果剛好有類似的疑惑，不妨參照上述分析進(jìn)行理解。如果想知道更多相關(guān)知識(shí)，歡迎關(guān)注億速云行業(yè)資訊頻道。

向AI問一下細(xì)節(jié)

推薦閱讀：

免責(zé)聲明：本站發(fā)布的內(nèi)容（圖片、視頻和文字）以原創(chuàng)、轉(zhuǎn)載和分享為主，文章觀點(diǎn)不代表本網(wǎng)站立場(chǎng)，如果涉及侵權(quán)請(qǐng)聯(lián)系站長(zhǎng)郵箱：is@yisu.com進(jìn)行舉報(bào)，并提供相關(guān)證據(jù)，一經(jīng)查實(shí)，將立刻刪除涉嫌侵權(quán)內(nèi)容。

上一篇新聞：
JAVA中spring配置文件出現(xiàn)錯(cuò)誤提示Class 'org.apache.commons.dbcp.BasicDataSource' not found怎么辦
下一篇新聞：
c語言怎么實(shí)現(xiàn)含遞歸清場(chǎng)版掃雷游戲

猜你喜歡

AI
助
手

產(chǎn)品服務(wù)

地區(qū)劃分

專題活動(dòng)

幫助支持

關(guān)于我們

售后咨詢

7*24小時(shí)在線電話：400-100-2938

7*24小時(shí)在線 QQ：800811969

關(guān)注億速云

億速云公眾號(hào)

手機(jī)網(wǎng)站二維碼