尔需要收拾整顿一个表格文献记载对于应的样原疑息以下,但是那些疑息散布正在差别的表格需要兼并:meta.tsvIDSRRGSMPtStatusCombinedBamBai3219_SR_1 SRR17839312GSM5851565Pt1PrePt1Pre ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839312/3219_SR_1possorted_genome_bam.bam.1 ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839312/3219_SR_1possorted_genome_bam.bam.1.bai3219_SR_2 SRR17839311GSM5851566Pt1PostPt1Post ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839311/3219_SR_2possorted_genome_bam.bam.1 ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839311/3219_SR_2possorted_genome_bam.bam.1.bai3219_SR_3 SRR17839310GSM5851567Pt2PrePt2Pre ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839310/3219_SR_3possorted_genome_bam.bam.1 ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839310/3219_SR_3possorted_genome_bam.bam.1.bai3219_SR_4 SRR17839309GSM5851568Pt2PostPt2Post ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839309/3219_SR_4possorted_genome_bam.bam.1 ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839309/3219_SR_4possorted_genome_bam.bam.1.bai3219_SR_5 SRR17839308GSM5851569Pt3PrePt3Pre ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839308/3219_SR_5possorted_genome_bam.bam.1 ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839308/3219_SR_5possorted_genome_bam.bam.1.bai3219_SR_6 SRR17839307GSM5851570Pt3PostPt3Post ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839307/3219_SR_6possorted_genome_bam.bam.1 ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839307/3219_SR_6possorted_genome_bam.bam.1.bai3521_SR_1 SRR17839306GSM5851571Pt4PrePt4Pre ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839306/3521_SR_1possorted_genome_bam.bam.1 ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839306/3521_SR_1possorted_genome_bam.bam.1.bai3521_SR_2 SRR17839305GSM5851572Pt4PostPt4Post ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839305/3521_SR_2possorted_genome_bam.bam.1 ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839305/3521_SR_2possorted_genome_bam.bam.1.bai
表格疑息滥觞以下:
ena 搜刮勾选需要的数据 数据下载路子:https://www.ebi.ac.uk/ena/browser/view/PRJNA802247
获得下载链交表格1:wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839311/3219_SR_2possorted_genome_bam.bam.1.baiwget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839309/3219_SR_4possorted_genome_bam.bam.1.baiwget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839309/3219_SR_4possorted_genome_bam.bam.1wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839307/3219_SR_6possorted_genome_bam.bam.1wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839312/3219_SR_1possorted_genome_bam.bam.1wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839305/3521_SR_2possorted_genome_bam.bam.1wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839312/3219_SR_1possorted_genome_bam.bam.1.baiwget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839306/3521_SR_1possorted_genome_bam.bam.1wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839308/3219_SR_5possorted_genome_bam.bam.1wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839310/3219_SR_3possorted_genome_bam.bam.1wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839308/3219_SR_5possorted_genome_bam.bam.1.baiwget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839306/3521_SR_1possorted_genome_bam.bam.1.baiwget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839310/3219_SR_3possorted_genome_bam.bam.1.baiwget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839307/3219_SR_6possorted_genome_bam.bam.1.baiwget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839305/3521_SR_2possorted_genome_bam.bam.1.baiwget -nc ftp://ftp.sra.ebi.ac.uk/vol1/run/SRR178/SRR17839311/3219_SR_2possorted_genome_bam.bam.1
那些疑息散布正在差别的表格中,咱们需要收拾整顿一下,那里能够颠末AI模子助咱们处置一下:
表格2 GSM号战样原ID疑疑GSM5851565 ScRNA Pt1 PreGSM5851566 ScRNA Pt1 PostGSM5851567 ScRNA Pt2 PreGSM5851568 ScRNA Pt2 PostGSM5851569 ScRNA Pt3 PreGSM5851570 ScRNA Pt3 PostGSM5851571 ScRNA Pt4 PreGSM5851572 ScRNA Pt4 Post
表格3 数据SRR号战GSM的疑息:
Run Library Name tissue tissue_type time_pointSRR17839305 GSM5851572 Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma postSRR17839306 GSM5851571 Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma preSRR17839307 GSM5851570 Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma postSRR17839308 GSM5851569 Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma preSRR17839309 GSM5851568 Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma postSRR17839310 GSM5851567 Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma preSRR17839311 GSM5851566 Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma postSRR17839312 GSM5851565 Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma pre
使用AI干表格兼并:
那里用的是豆包速率会快一点儿,deepseek也能够完毕可是比力卡,如下是提醒词汇及完毕历程:
终极成果以下:
文献名 Library Name Description Tissue Tissue Type Time Point3219_SR_4 GSM5851568 ScRNA Pt2 Post Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma post3219_SR_6 GSM5851570 ScRNA Pt3 Post Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma post3219_SR_1 GSM5851565 ScRNA Pt1 Pre Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma pre3521_SR_2 GSM5851572 ScRNA Pt4 Post Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma post3521_SR_1 GSM5851571 ScRNA Pt4 Pre Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma pre3219_SR_5 GSM5851569 ScRNA Pt3 Pre Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma pre3219_SR_3 GSM5851567 ScRNA Pt2 Pre Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma pre3219_SR_2 GSM5851566 ScRNA Pt1 Post Tumor Head And Neck Oral Cavity Squamous Cell Carcinoma post
各人能够翻开链交寓目对于话实质:https://www.doubao.com/thread/w882cbc4f971511f0