十行代码完成circRNA多种ID相互转换
科研热点层出不穷,从技术层面来看miRNA,lncRNA,circRNA,ceRNA各领风骚一两年,现在又是m6A和单细胞。前面我们在生信技能树已经系统性的总结了circRNA的相关背景知识:
> a[1:4,1:4]
ID_REF GSM2561829 GSM2561830 GSM2561831
1 ASCRP000002 9.042573 9.238902 8.997313
2 ASCRP000004 10.219584 9.999965 10.246754
3 ASCRP000005 5.997230 6.022147 6.075589
4 ASCRP000006 7.918213 7.954153 8.005365
ID circRNA SPOT_ID PROBE_TYPE BUILD SEQUENCE
15 AS_circRNA_control9 control control TGTCAGTCTGCAGCTACTCAGATCAACCTCTCCACTTCTCCTACAC
16 ASCRP000001 hsa_circRNA_000006 hsa_circRNA_000006 circRNA HG19 TGGGCTTGAGGCCTGATCTTTTGGCCAGAAGGAGATTAAAAAGATG
17 ASCRP000002 hsa_circRNA_000010 hsa_circRNA_000010 circRNA HG19 TCCTTTTGGCCTCACCCAATGACCTGGCTGAAGAAGAGCCCAAGGA
18 ASCRP000003 hsa_circRNA_000028 hsa_circRNA_000028 circRNA HG19 TAAGCCAAATGACTAACAGTAATTAAAATGGAAATGGCACAGGGAG
19 ASCRP000004 hsa_circRNA_000031 hsa_circRNA_000031 circRNA HG19 ATTGGCACTCAGTGACCATCAGGCTGGCTGTGTGCGGCAGCTTCCT
20 ASCRP000005 hsa_circRNA_000041 hsa_circRNA_000041 circRNA HG19 GCAGGTTGAGGATTTTATTTGATCCCTGCTCTAATTTTTAGCTTCA
GPL21825:074301 Arraystar Human CircRNA microarray V2 GPL19978:Agilent-069978 Arraystar Human CircRNA microarray V1 GPL26925:Agilent-084217 CapitalBio Technology Human CircRNA Array v2 GPL23467:Agilent-082557 CBChuman circRNA array V2.0
> head(e)
circRNA Alias
1 hsa_circRNA_000001 hsa_circ_0000001
2 hsa_circRNA_100003 hsa_circ_0000002
3 hsa_circRNA_100011 hsa_circ_0000007
4 hsa_circRNA_100017 hsa_circ_0000008
5 hsa_circRNA_000031 hsa_circ_0000009
6 hsa_circRNA_092361 hsa_circ_0000010
a=fread('probeMatrix.txt',data.table = F)
a[1:4,1:4]
b=read.table('ann.txt',sep = '\t',header = T)
tail(head(b,20))
d=merge(a,b,by.x='ID_REF',by.y='ID')
e=read.table('ID.txt',header = T)
head(e)
f=merge(e,d,by='circRNA')
head(f[,1:6])
文末友情宣传
全国巡讲全球听(买一得五)第3期(4月6日开始) ,你的生物信息学入门课。 数据挖掘线上班来袭(两天变三周,实力加量),医学生/医生首选技能提高课。 生信技能树的2019年终总结 ,你的生物信息学成长宝藏 2020学习主旋律,B站74小时免费教学视频为你领路
付费内容分割线
赞 (0)