Tutorials

预备知识

这是一篇写给使用者中的初学者的指南,将介绍pdb-profiling的基本功能。阅读之前,请先了解以下知识点:

  • 什么是PDB
  • 什么是UniProt-KB
  • 什么是可变剪切(Alternative Splicing)
  • 什么是蛋白质一级结构 (序列)
  • 什么是蛋白质三级结构 (晶体结构)
  • 什么是蛋白-蛋白相互作用 (Protein-Protein Interaction, PPI)
  • 基础Python编程知识
  • pip安装Python Package知识
  • 熟悉Pandas包的DataFrame相关知识
阅读这部分不需要对Python Asynchronous Programming有任何了解。

安装

pdb-profiling基于Python3,是一个Python Package,需要您的电脑预先安装了Python3.6及以上版本才可使用。

*安装之前

  • 确保您的64位电脑安装了64位的Python
  • 为了避免一些意想不到的错误,请先运行以下命令以升级pip:
pip install --upgrade pip

正式安装步骤

请在终端中执行以下pip命令以安装pdb-profiling:

pip install pdb-profiling

这个过程中会同时安装相关依赖包。

如果您已经安装过先前版本的pdb-profiling,请运行以下命令以对pdb-profiling进行更新:

pip install --upgrade pdb-profiling

以上步骤即可完成安装。

初始化输出文件夹

from pdb_profiling import default_config

default_config(your_output_folder)

利用default_config函数即可初始化相关设置,若有细节化的设置需求请参见Section5: Reference

your_output_folder变量即您设定的输出文件夹/工作目录。若不传入任何参数,默认采用当前目录。

default_config运行完后会在工作目录下创建众多子文件夹以及数据库文件,请不要删除它们。

调用PDBe RESTful API

以PDB Entry 3hl2为例:

导入所需class/function

from pdb_profiling.processors import PDB

检索并获取Tabular Format结果

下面以获取https://www.ebi.ac.uk/pdbe/api/pdb/entry/molecules/3hl2的信息为例,其给出PDB条目中所有分子Entity-Chain level元数据信息,包括分子类型、分子名称、SEQRES序列等:

pdb_object = PDB('3hl2')

dfrm = pdb_object.fetch_from_pdbe_api('api/pdb/entry/molecules/', PDB.to_dataframe).result()
Click to view dataframe
ca_p_onlyentity_idgene_namein_chainsin_struct_asymslengthmolecule_namemolecule_typemutation_flagnumber_of_copiespdb_sequencepdb_sequence_indices_with_multiple_residuessample_preparationsequencesourcesynonymweightpdb_id
0False1["SEPSECS","TRNP48"]["A","B","C","D"]["A","B","C","D"]501.0["O-phosphoseryl-tRNA(Sec) selenium transferase"]polypeptide(L)NaN4MNRESFAAGERLVSPAYVRQGCEARRSHEHLIRLLLEKGKCPENGW...{}Genetically manipulatedMNRESFAAGERLVSPAYVRQGCEARRSHEHLIRLLLEKGKCPENGW...[{"expression_host_scientific_name":"Escherich...O-phosphoseryl-tRNA(Sec) selenium transferase55801.2113hl2
1False2NaN["E"]["E"]90.0["tRNASec"]polyribonucleotideNaN1GCCCGGAUGAUCCUCAGUGGUCUGGGGUGCAGGCUUCAAACCUGUA...{}Genetically manipulatedGCCCGGAUGAUCCUCAGUGGUCUGGGGUGCAGGCUUCAAACCUGUA...[{"expression_host_scientific_name":"Escherich...tRNASec28908.0843hl2
2False3NaN["A","B","C","D"]["F","I","L","N"]NaN["(5-HYDROXY-4,6-DIMETHYLPYRIDIN-3-YL)METHYL D...boundNaN4NaNNaNSynthetically obtainedNaNNaNNaN233.1583hl2
3False4NaN["A","C"]["G","H","M"]NaN["Monothiophosphate"]boundNaN3NaNNaNSynthetically obtainedNaNNaNNaN114.0613hl2
4False5NaN["B","D"]["J","K","O","P"]NaN["PHOSPHOSERINE"]boundNaN4NaNNaNSynthetically obtainedNaNNaNNaN185.0723hl2
5False6NaN["A","B","C","D","E"]["Q","R","S","T","U"]NaN["water"]waterNaN288NaNNaNNatural sourceNaNNaNNaN18.0153hl2

也可通过不传入其他参数而直接获取文件路径:

pdb_object.fetch_from_pdbe_api('api/pdb/entry/molecules/').result()
# >>> WindowsPath('./api/pdb/entry/molecules/api%pdb%entry%molecules%+3hl2.tsv')

可通过如下代码查看所有可用的api集:

from pdb_profiling.processors.pdbe.record import API_SET
print(API_SET)
Click to view API_SET
{"api/mappings/",
 "api/mappings/all_isoforms/",
 "api/mappings/best_structures/",
 "api/mappings/cath/",
 "api/mappings/cath_b/",
 "api/mappings/ec/",
 "api/mappings/ensembl/",
 "api/mappings/go/",
 "api/mappings/hmmer/",
 "api/mappings/homologene/",
 "api/mappings/homologene_uniref90/",
 "api/mappings/interpro/",
 "api/mappings/isoforms/",
 "api/mappings/pfam/",
 "api/mappings/scop/",
 "api/mappings/sequence_domains/",
 "api/mappings/structural_domains/",
 "api/mappings/uniprot/",
 "api/mappings/uniprot_publications/",
 "api/mappings/uniprot_segments/",
 "api/mappings/uniprot_to_pfam/",
 "api/mappings/uniref90/",
 "api/pdb/entry/assembly/",
 "api/pdb/entry/binding_sites/",
 "api/pdb/entry/carbohydrate_polymer/",
 "api/pdb/entry/cofactor/",
 "api/pdb/entry/drugbank/",
 "api/pdb/entry/electron_density_statistics/",
 "api/pdb/entry/experiment/",
 "api/pdb/entry/files/",
 "api/pdb/entry/ligand_monomers/",
 "api/pdb/entry/modified_AA_or_NA/",
 "api/pdb/entry/molecules/",
 "api/pdb/entry/mutated_AA_or_NA/",
 "api/pdb/entry/observed_residues_ratio/",
 "api/pdb/entry/polymer_coverage/",
 "api/pdb/entry/related_experiment_data/",
 "api/pdb/entry/residue_listing/",
 "api/pdb/entry/secondary_structure/",
 "api/pdb/entry/status/",
 "api/pdb/entry/summary/",
 "api/pisa/interfacedetail/",
 "api/pisa/interfacelist/",
 "graph-api/compound/atoms/",
 "graph-api/compound/bonds/",
 "graph-api/compound/cofactors/",
 "graph-api/compound/summary/",
 "graph-api/mappings/",
 "graph-api/mappings/all_isoforms/",
 "graph-api/mappings/best_structures/",
 "graph-api/mappings/ensembl/",
 "graph-api/mappings/homologene/",
 "graph-api/mappings/isoforms/",
 "graph-api/mappings/sequence_domains/",
 "graph-api/mappings/uniprot/",
 "graph-api/mappings/uniprot_segments/",
 "graph-api/pdb/bound_excluding_branched/",
 "graph-api/pdb/bound_molecules/",
 "graph-api/pdb/funpdbe/",
 "graph-api/pdb/funpdbe_annotation/",
 "graph-api/pdb/funpdbe_annotation/14-3-3-pred/",
 "graph-api/pdb/funpdbe_annotation/3Dcomplex/",
 "graph-api/pdb/funpdbe_annotation/3dligandsite/",
 "graph-api/pdb/funpdbe_annotation/ChannelsDB/",
 "graph-api/pdb/funpdbe_annotation/FoldX/",
 "graph-api/pdb/funpdbe_annotation/M-CSA/",
 "graph-api/pdb/funpdbe_annotation/MetalPDB/",
 "graph-api/pdb/funpdbe_annotation/Missense3D/",
 "graph-api/pdb/funpdbe_annotation/POPScomp_PDBML/",
 "graph-api/pdb/funpdbe_annotation/ProKinO/",
 "graph-api/pdb/funpdbe_annotation/akid/",
 "graph-api/pdb/funpdbe_annotation/camkinet/",
 "graph-api/pdb/funpdbe_annotation/canSAR/",
 "graph-api/pdb/funpdbe_annotation/cath-funsites/",
 "graph-api/pdb/funpdbe_annotation/depth/",
 "graph-api/pdb/funpdbe_annotation/dynamine/",
 "graph-api/pdb/funpdbe_annotation/p2rank/",
 "graph-api/pdb/ligand_monomers/",
 "graph-api/pdb/modified_AA_or_NA/",
 "graph-api/pdb/mutated_AA_or_NA/",
 "graph-api/pdb/secondary_structure/",
 "graph-api/pdb/sequence_conservation/",
 "graph-api/residue_mapping/"}
若使用不符要求或暂不支持的API则会抛出异常。

批量获取

from pdb_profiling.processors import PDBs

pdb_objects = PDBs(['1a01', '2xyn', '3hl2', '4hho'])

这样即得到了一个PDBs集合对象,内含多个PDB对象:

(<PDB 1a01>, <PDB 2xyn>, <PDB 3hl2>, <PDB 4hho>)

直接对该PDBs进行相关函数的调用即可批量获取所有PDB的相关结果:

res = pdb_objects.fetch(
    'fetch_from_pdbe_api', 
    api_suffix='api/pdb/entry/summary/', 
    then_func=PDB.to_dataframe).run().result()

返回的res是一个list,每个元素都是一个pandas.DataFrame,可用pandas.concat将多个dataframe整合。

注意: 此步骤得到的res中,结果出现次序并不一定按照PDBs中PDB的顺序。
from pandas import concat
concat(res, sort=False, ignore_index=True)
Click to view dataframe
assembliesdeposition_datedeposition_siteentry_authorsexperimental_methodexperimental_method_classnumber_of_entitiespdb_idprocessing_siterelated_structuresrelease_daterevision_datesplit_entrytitle
0[{"assembly_id":"1","form":"Non-polymer only",...19971208NaN["Kavanaugh, J.S.","Arnone, A."]["X-ray diffraction"]["x-ray"]{"polypeptide":2,"dna":0,"ligand":1,"dna/rna":...1a01BNL[]1998031820110713[]HEMOGLOBIN (VAL BETA1 MET, TRP BETA37 ALA) MUTANT
1[{"preferred":true,"form":"homo","name":"monom...20101118PDBE["Salah, E.","Ugochukwu, E.","Elkins, J.M.","B...["X-ray diffraction"]["x-ray"]{"water":1,"polypeptide":1,"other":0,"dna":0,"...2xynPDBE[]2010120120190403[]HUMAN ABL2 IN COMPLEX WITH AURORA KINASE INHIB...
2[{"preferred":true,"form":"hetero","name":"pen...20090526RCSB["Palioura, S.","Steitz, T.A.","Soll, D.","Sim...["X-ray diffraction"]["x-ray"]{"water":1,"polypeptide":1,"other":0,"dna":0,"...3hl2RCSB[]2009100620180124[]The crystal structure of the human SepSecS-tRN...
3[{"assembly_id":"1","form":"homo","preferred":...20121010RCSB["Moshe, B.-D.","Grzegorz, W.","Mikael, E.","I...["X-ray diffraction"]["x-ray"]{"polypeptide":1,"dna":0,"ligand":3,"dna/rna":...4hhoRCSB[]2013032720130327[]Serum paraoxonase-1 by directed evolution with...

批量获取内置函数

PDBs类也内置了几个预先设定好的信息获取流程函数。

获取实验方法相关信息:

from pandas import DataFrame

exp_res = pdb_objects.fetch(PDBs.fetch_exp_pipe).run().result()

DataFrame(
    (j for i in exp_res for j in i),
    columns=['pdb_id',
             'resolution',
             'experimental_method_class',
             'experimental_method',
             'multi_method'])
Click to view dataframe
pdb_idresolutionexperimental_method_classexperimental_methodmulti_method
04hho2.10x-rayX-ray diffractionFalse
11a011.80x-rayX-ray diffractionFalse
23hl22.81x-rayX-ray diffractionFalse
32xyn2.81x-rayX-ray diffractionFalse

获取日期相关信息:

date_res = pdb_objects.fetch(PDBs.fetch_date).run().result()
DataFrame(date_res, columns=['pdb_id', 'revision_date', 'deposition_date'])

调用SIFTS API

from pdb_profiling.processors import SIFTS, SIFTSs

sifts_from_pdb_demo = SIFTS('1a01')    # <SIFTS PDB Entry 1a01>
sifts_from_unp_demo = SIFTS('Q5VST9')  # <SIFTS UniProt Q5VST9>

SIFTS类会自动检测输入的identifier的类型,若不符要求则会报错。

From PDB to UniProt Isoform

下面展示利用SIFTS API获取目标PDB对应的所有蛋白链已知的对应UniProt Isoform信息:

sifts_from_pdb_demo.fetch_from_pdbe_api('api/mappings/all_isoforms/', SIFTS.to_dataframe).result()
Click to view dataframe
UniProtchain_idendentity_ididentifieridentityis_canonicalnamepdb_endpdb_idpdb_startstartstruct_asym_idunp_endunp_start
0P68871B{"author_residue_number":146,"author_insertion...2HBB_HUMAN0.99TrueHBB_HUMAN1461a011{"author_residue_number":1,"author_insertion_c...B1472
1P68871D{"author_residue_number":146,"author_insertion...2HBB_HUMAN0.99TrueHBB_HUMAN1461a011{"author_residue_number":1,"author_insertion_c...D1472
2P69905A{"author_residue_number":141,"author_insertion...1HBA_HUMAN1.00TrueHBA_HUMAN1411a011{"author_residue_number":1,"author_insertion_c...A1422
3P69905C{"author_residue_number":141,"author_insertion...1HBA_HUMAN1.00TrueHBA_HUMAN1411a011{"author_residue_number":1,"author_insertion_c...C1422

From UniProt Isoform to PDB

利用SIFTS API获取目标UniPRot Isoform对应的所有PDB链信息也是同理:

sifts_from_unp_demo.fetch_from_pdbe_api('api/mappings/all_isoforms/', SIFTS.to_dataframe).result()
Click to view dataframe
UniProtchain_idendentity_ididentityis_canonicalpdb_idstartstruct_asym_idunp_endunp_start
0Q5VST9A{"author_residue_number":97,"author_insertion_...11.00True2dku{"author_residue_number":8,"author_insertion_c...A30042915
1Q5VST9A{"author_residue_number":102,"author_insertion...10.75True2dm7{"author_residue_number":8,"author_insertion_c...A36313537
2Q5VST9A{"author_residue_number":548,"author_insertion...10.89True2yz8{"author_residue_number":459,"author_insertion...A32733184
3Q5VST9A{"author_residue_number":null,"author_insertio...10.98True5tzm{"author_residue_number":3,"author_insertion_c...A45214431
4Q5VST9A{"author_residue_number":null,"author_insertio...10.93True4rsv{"author_residue_number":null,"author_insertio...A44294337
5Q5VST9O{"author_residue_number":null,"author_insertio...11.00True4c4k{"author_residue_number":9,"author_insertion_c...A1039
6Q5VST9A{"author_residue_number":109,"author_insertion...11.00True2cr6{"author_residue_number":8,"author_insertion_c...A31002999
7Q5VST9A{"author_residue_number":68,"author_insertion_...11.00True1v1c{"author_residue_number":1,"author_insertion_c...A56685601
8Q5VST9A{"author_residue_number":93,"author_insertion_...10.93True2mwc{"author_residue_number":1,"author_insertion_c...A44294337
9Q5VST9A{"author_residue_number":96,"author_insertion_...11.00True2edr{"author_residue_number":8,"author_insertion_c...A34493361
10Q5VST9A{"author_residue_number":101,"author_insertion...11.00True2edq{"author_residue_number":8,"author_insertion_c...A38063713
11Q5VST9A{"author_residue_number":101,"author_insertion...11.00True2edw{"author_residue_number":8,"author_insertion_c...A36303537
12Q5VST9A{"author_residue_number":96,"author_insertion_...11.00True2edt{"author_residue_number":8,"author_insertion_c...A35373449
13Q5VST9A{"author_residue_number":101,"author_insertion...10.82True2gqh{"author_residue_number":8,"author_insertion_c...A38073714
14Q5VST90{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...A203110
15Q5VST92{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...C203110
16Q5VST94{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...E203110
17Q5VST96{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...G203110
18Q5VST98{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...I203110
19Q5VST9A{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...K203110
20Q5VST9C{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...M203110
21Q5VST9E{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...O203110
22Q5VST9G{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...Q203110
23Q5VST9I{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...S203110
24Q5VST9K{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...U203110
25Q5VST9M{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...W203110
26Q5VST9O{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...Y203110
27Q5VST9Q{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...AA203110
28Q5VST9S{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...CA203110
29Q5VST9U{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...EA203110
30Q5VST9W{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...GA203110
31Q5VST9Y{"author_residue_number":98,"author_insertion_...10.30True4uow{"author_residue_number":5,"author_insertion_c...IA203110
32Q5VST9A{"author_residue_number":97,"author_insertion_...11.00True2edf{"author_residue_number":8,"author_insertion_c...A29152826
33Q5VST9A{"author_residue_number":97,"author_insertion_...11.00True2eo1{"author_residue_number":8,"author_insertion_c...A17121623
34Q5VST9A{"author_residue_number":98,"author_insertion_...10.94True2eny{"author_residue_number":8,"author_insertion_c...A28252735
35Q5VST9A{"author_residue_number":107,"author_insertion...11.00True2edh{"author_residue_number":8,"author_insertion_c...A37133614
36Q5VST9A{"author_residue_number":104,"author_insertion...10.92True2edl{"author_residue_number":8,"author_insertion_c...A38973801
37Q5VST9A{"author_residue_number":93,"author_insertion_...10.95True6mg9{"author_residue_number":2,"author_insertion_c...A43384247
38Q5VST9A{"author_residue_number":95,"author_insertion_...10.99True2n56{"author_residue_number":1,"author_insertion_c...A45244430
39Q5VST9A{"author_residue_number":97,"author_insertion_...11.00True2e7b{"author_residue_number":8,"author_insertion_c...A32733184

Reformat & Detect

对于SIFTS中提供的信息,pdb-profiling还可进行后续的匹配区域修正和InDel segment、repeated segment、reversed segment以及conflict segment的检测:

SIFTS('P21359-2').fetch_from_pdbe_api('api/mappings/all_isoforms/', SIFTS.to_dataframe
    ).then(SIFTS.reformat
    ).then(SIFTS.dealWithInDel
    ).then(SIFTS.fix_range
    ).then(SIFTS.add_residue_conflict).result()
Click to view dataframe
UniProtchain_identity_ididentityis_canonicalpdb_idstruct_asym_idpdb_rangeunp_rangeEntry...sifts_range_tagrepeatedreversedInDel_sumnew_pdb_rangenew_unp_rangeconflict_pdb_indexconflict_pdb_rangeconflict_unp_rangeunp_len
0P21359-2A11.00False1nf1A[[1,333]][[1198,1530]]P21359...SafeFalseFalse0[[1,333]][[1198,1530]]{"216":"R"}[[216,216]][[1413,1413]]2818
1P21359-2A11.00False2d4qA[[1,257]][[1560,1816]]P21359...SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]2818
2P21359-2B11.00False2d4qB[[1,257]][[1560,1816]]P21359...SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]2818
3P21359-2A10.99False2e2xA[[6,277]][[1545,1816]]P21359...SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]2818
4P21359-2B10.99False2e2xB[[6,277]][[1545,1816]]P21359...SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]2818
5P21359-2A10.98False3p7zA[[5,276]][[1545,1816]]P21359...SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]2818
6P21359-2B10.98False3p7zB[[5,276]][[1545,1816]]P21359...SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]2818
7P21359-2A10.94False3pegA[[5,172],[174,290]][[1545,1712],[1700,1816]]P21359...InDel_1TrueFalse-12[[5,172],[174,290]][[1545,1712],[1700,1816]]{}[][]2818
8P21359-2A11.00False3pg7A[[1,256]][[1560,1816]]P21359...DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]2818
9P21359-2B11.00False3pg7B[[1,256]][[1560,1816]]P21359...DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]2818
10P21359-2B21.00False6ob2B[[2,256]][[1209,1463]]P21359...SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]2818
11P21359-2D21.00False6ob2D[[2,256]][[1209,1463]]P21359...SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]2818
12P21359-2B21.00False6ob3B[[2,256]][[1209,1463]]P21359...SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]2818
13P21359-2D21.00False6ob3D[[2,256]][[1209,1463]]P21359...SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]2818
14P21359-2B21.00False6v65B[[2,329]][[1203,1530]]P21359...SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]2818
15P21359-2B21.00False6v6fB[[2,329]][[1203,1530]]P21359...SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]2818

批量检索SIFTS

同样,pdb-profling也提供了SIFTSs类供用户批量检索SIFTS API相关数据。

SIFTSs(('P21359', 'Q5VST9', 'P21359-5')).fetch('fetch_from_pdbe_api', api_suffix='api/mappings/all_isoforms/', then_func=SIFTS.to_dataframe).run().result()

对SIFTS匹配关系进行打分

尽管SIFTS提供了identity指标可用来评判UniProt Isoform序列与相应匹配上的PDB Chain的SEQRES序列的一致性,但这一identity只能体现局部匹配区域的情况,未能兼顾匹配区域外序列以及倒序匹配等对于蛋白质结构研究的影响。因此pdb-profiling并通过整理出6个指标提供了RAW_BS这一加权分值来辅助用户评判出UniProt Isoform与PDB Chain的相符度。

sifts_info, score_detail, experimental_info = SIFTS('P21359-3').pipe_score().result()

函数返回3个dataframe

检索Assembly与Interface相关信息

现在让我们回到PDB类来,重点来看看Entry-Assembly/Model-Chain这几个层次的数据。

对于一个PDB,其对应的Assembly相关信息可以通过如下方式获取:

pdb_object.profile_id().result()
Click to view dataframe
pdb_identity_idmolecule_typechain_idstruct_asym_idassembly_idmodel_idasym_id_rankoper_expressionsymmetry_operationsymmetry_idstruct_asym_id_in_assemblyau_subsetdetails
03hl21polypeptide(L)AA011AFalseasymmetric_unit
13hl21polypeptide(L)CC011CFalseasymmetric_unit
23hl21polypeptide(L)BB011BFalseasymmetric_unit
33hl21polypeptide(L)DD011DFalseasymmetric_unit
43hl22polyribonucleotideEE011EFalseasymmetric_unit
53hl23boundBI011IFalseasymmetric_unit
63hl23boundDN011NFalseasymmetric_unit
73hl23boundCL011LFalseasymmetric_unit
83hl23boundAF011FFalseasymmetric_unit
93hl24boundAH011HFalseasymmetric_unit
103hl24boundCM011MFalseasymmetric_unit
113hl24boundAG011GFalseasymmetric_unit
123hl25boundDP011PFalseasymmetric_unit
133hl25boundBK011KFalseasymmetric_unit
143hl25boundBJ011JFalseasymmetric_unit
153hl25boundDO011OFalseasymmetric_unit
163hl21polypeptide(L)AA111["1"]["x,x-y-1,-z"]["6_545"]AFalseauthor_defined_assembly
173hl21polypeptide(L)BB111["1"]["x,x-y-1,-z"]["6_545"]BFalseauthor_defined_assembly
183hl23boundAF111["1"]["x,x-y-1,-z"]["6_545"]FFalseauthor_defined_assembly
193hl24boundAG111["1"]["x,x-y-1,-z"]["6_545"]GFalseauthor_defined_assembly
203hl24boundAH111["1"]["x,x-y-1,-z"]["6_545"]HFalseauthor_defined_assembly
213hl23boundBI111["1"]["x,x-y-1,-z"]["6_545"]IFalseauthor_defined_assembly
223hl25boundBJ111["1"]["x,x-y-1,-z"]["6_545"]JFalseauthor_defined_assembly
233hl25boundBK111["1"]["x,x-y-1,-z"]["6_545"]KFalseauthor_defined_assembly
243hl21polypeptide(L)AA122["2"]["x,y,z"]["1_555"]AAFalseauthor_defined_assembly
253hl21polypeptide(L)BB122["2"]["x,y,z"]["1_555"]BAFalseauthor_defined_assembly
263hl22polyribonucleotideEE121["2"]["x,y,z"]["1_555"]EFalseauthor_defined_assembly
273hl23boundAF122["2"]["x,y,z"]["1_555"]FAFalseauthor_defined_assembly
283hl24boundAG122["2"]["x,y,z"]["1_555"]GAFalseauthor_defined_assembly
293hl24boundAH122["2"]["x,y,z"]["1_555"]HAFalseauthor_defined_assembly
303hl23boundBI122["2"]["x,y,z"]["1_555"]IAFalseauthor_defined_assembly
313hl25boundBJ122["2"]["x,y,z"]["1_555"]JAFalseauthor_defined_assembly
323hl25boundBK122["2"]["x,y,z"]["1_555"]KAFalseauthor_defined_assembly
333hl21polypeptide(L)CC211["3"]["-x+y,y,-z+1/3"]["5_555"]CFalseauthor_defined_assembly
343hl21polypeptide(L)DD211["3"]["-x+y,y,-z+1/3"]["5_555"]DFalseauthor_defined_assembly
353hl23boundCL211["3"]["-x+y,y,-z+1/3"]["5_555"]LFalseauthor_defined_assembly
363hl24boundCM211["3"]["-x+y,y,-z+1/3"]["5_555"]MFalseauthor_defined_assembly
373hl23boundDN211["3"]["-x+y,y,-z+1/3"]["5_555"]NFalseauthor_defined_assembly
383hl25boundDO211["3"]["-x+y,y,-z+1/3"]["5_555"]OFalseauthor_defined_assembly
393hl25boundDP211["3"]["-x+y,y,-z+1/3"]["5_555"]PFalseauthor_defined_assembly
403hl21polypeptide(L)CC222["2"]["x,y,z"]["1_555"]CAFalseauthor_defined_assembly
413hl21polypeptide(L)DD222["2"]["x,y,z"]["1_555"]DAFalseauthor_defined_assembly
423hl22polyribonucleotideEE221["2"]["x,y,z"]["1_555"]EFalseauthor_defined_assembly
433hl23boundCL222["2"]["x,y,z"]["1_555"]LAFalseauthor_defined_assembly
443hl24boundCM222["2"]["x,y,z"]["1_555"]MAFalseauthor_defined_assembly
453hl23boundDN222["2"]["x,y,z"]["1_555"]NAFalseauthor_defined_assembly
463hl25boundDO222["2"]["x,y,z"]["1_555"]OAFalseauthor_defined_assembly
473hl25boundDP222["2"]["x,y,z"]["1_555"]PAFalseauthor_defined_assembly

其中assembly_id为0指asymmetric unit

与PISA资源对接

Assembly

pdb-profiling依照面对对象编程的思想,也提供了Assembly的相应类PDBAssembly,可通过如下方式获取到PDB的相关Biological Assembly:

pdb_object.set_assembly().result()
assemblies = pdb_object.assembly
'''
>>> assemblies  # Note a python dictionary
>>> {0: <PDBAssembly 3hl2/0>, 1: <PDBAssembly 3hl2/1>, 2: <PDBAssembly 3hl2/2>}
'''

如若事先知道PDB条目的某一Assembly的编号,也可通过下面的方式获得对应对象:

from pdb_profiling.processors import PDBAssembly

pdb_assembly_object = PDBAssembly('3hl2/1')
pdb_assembly_object.Assembly_summary
'''
{'preferred': True, 'form': 'hetero', 'name': 'pentamer', 'assembly_id': '1'}
'''

Interface

整合PDBe PISA API 资源

对于PISA资源中定义的Interface,pdb-profiling也提供了Interface的相应类PDBInterface,可通过如下方式获取到PDBAssembly的相关Interface:

pdb_assembly_object = assemblies[1]
pdb_assembly_object.pipe_protein_protein_interface().result()
interfaces = pdb_assembly_object.interface
interfaces
'''
{1: <PDBInterface 3hl2/1/1>,
 2: <PDBInterface 3hl2/1/2>,
 3: <PDBInterface 3hl2/1/3>,
 4: <PDBInterface 3hl2/1/4>,
 13: <PDBInterface 3hl2/1/13>,
 14: <PDBInterface 3hl2/1/14>}
'''

如若事先知道PDB Assembly下某一Interface的编号,也可通过下面的方式获得对应对象:

from pdb_profiling.processors import PDBInterface

pdb_interface_object = PDBInterface('3hl2/1/3')
pdb_interface_object.get_interface_res_dict().result()
'''
{'entity_id_1': 1,
 'chain_id_1': 'A',
 'struct_asym_id_1': 'A',
 'struct_asym_id_in_assembly_1': 'AA',
 'asym_id_rank_1': 2,
 'model_id_1': 2,
 'molecule_type_1': 'polypeptide(L)',
 'surface_range_1': '[[23,61],[63,109],[111,120],[123,124],[127,130],[132,139],[141,146],[148,151],[153,164],[168,178],[180,204],[206,208],[210,216],[223,223],[225,227],[230,234],[236,237],[240,241],[243,246],[248,248],[252,252],[254,255],[257,262],[264,265],[267,268],[270,276],[281,284],[286,292],[298,302],[304,305],[307,321],[324,325],[328,329],[331,373],[375,390],[392,424],[426,437],[439,456],[458,463]]',
 'interface_range_1': '[[23,25],[27,28],[31,31],[47,50],[52,53],[56,57],[60,61],[65,68],[82,82],[86,86],[89,90],[109,109]]',
 'entity_id_2': 1,
 'chain_id_2': 'B',
 'struct_asym_id_2': 'B',
 'struct_asym_id_in_assembly_2': 'B',
 'asym_id_rank_2': 1,
 'model_id_2': 1,
 'molecule_type_2': 'polypeptide(L)',
 'surface_range_2': '[[21,61],[63,109],[111,121],[123,125],[127,130],[132,139],[141,146],[149,150],[153,164],[167,178],[180,204],[206,208],[210,216],[223,223],[225,227],[230,234],[236,237],[240,241],[243,246],[248,248],[252,252],[254,255],[257,262],[264,265],[267,268],[270,276],[280,284],[286,292],[294,294],[298,302],[304,305],[307,326],[328,329],[331,424],[426,437],[439,446],[448,453],[455,456],[458,463]]',
 'interface_range_2': '[[21,22],[24,25],[27,28],[31,31],[47,50],[52,53],[56,57],[60,61],[65,68],[82,82],[86,86],[90,90]]',
 'pdb_id': '3hl2',
 'assembly_id': 1,
 'interface_id': 3,
 'use_au': False}
'''

与Interactome3D资源对接

首先是准备Interactome3D全物种完整interaction数据集,pdb-profiling目前仅取其中的PDB实验晶体结构相关相互作用信息:

from pdb_profiling.processors.i3d.api import Interactome3D

Interactome3D.pipe_init_interaction_meta().result()
注意: 上面这两行代码一旦运行过就再也不用运行,就算是重新开启python进程也是。因为上述代码是下载文件与准备数据库步骤。

准备好数据后即可访问相关相互作用信息:

SIFTS.search_partner_from_i3d('P21359', ('ho','he')).result()
Click to view dataframe
pdb_idEntry_1Entry_2assembly_idmodel_id_1chain_id_1model_id_2chain_id_2organisminteraction_type
06ob3P01116P2135911A1Bhumanhe
16ob3P01116P2135921C1Dhumanhe
26ob2P01116P2135911A1Bhumanhe
36ob2P01116P2135921C1Dhumanhe
42d4qP21359P2135911B1Ahumanho
52e2xP21359P2135911B1Ahumanho
63pegP21359P2135921A2Ahumanho
73p7zP21359P2135911B1Ahumanho

单体代表集结构选择

sifts_demo = SIFTS('P21359-2')
df1 = sifts_demo.pipe_select_mo().result()
df1
Click to view dataframe
UniProtchain_identity_ididentityis_canonicalpdb_idstruct_asym_idpdb_rangeunp_rangeEntry...resolutionexperimental_method_classexperimental_methodmulti_methodrevision_datedeposition_date1/resolutionid_scoreselect_tagselect_rank
0P21359-2A11.00False1nf1A[[1,333]][[1198,1530]]P21359...2.500x-rayX-ray diffractionFalse20171004199807080.400000-65False15
1P21359-2A11.00False2d4qA[[1,257]][[1560,1816]]P21359...2.300x-rayX-ray diffractionFalse20110713200510220.434783-65False4
2P21359-2B11.00False2d4qB[[1,257]][[1560,1816]]P21359...2.300x-rayX-ray diffractionFalse20110713200510220.434783-66False6
3P21359-2A10.99False2e2xA[[6,277]][[1545,1816]]P21359...2.500x-rayX-ray diffractionFalse20110713200611180.400000-65False12
4P21359-2B10.99False2e2xB[[6,277]][[1545,1816]]P21359...2.500x-rayX-ray diffractionFalse20110713200611180.400000-66False13
5P21359-2A10.98False3p7zA[[5,276]][[1545,1816]]P21359...2.650x-rayX-ray diffractionFalse20190717201010130.377358-65False5
6P21359-2B10.98False3p7zB[[5,276]][[1545,1816]]P21359...2.650x-rayX-ray diffractionFalse20190717201010130.377358-66True2
7P21359-2A11.00False3pg7A[[1,256]][[1560,1816]]P21359...2.189x-rayX-ray diffractionFalse20110713201010310.456830-65False7
8P21359-2B11.00False3pg7B[[1,256]][[1560,1816]]P21359...2.189x-rayX-ray diffractionFalse20110713201010310.456830-66False8
9P21359-2B21.00False6ob2B[[2,256]][[1209,1463]]P21359...2.845x-rayX-ray diffractionFalse20191113201903190.351494-66False14
10P21359-2D21.00False6ob2D[[2,256]][[1209,1463]]P21359...2.845x-rayX-ray diffractionFalse20191113201903190.351494-68False9
11P21359-2B21.00False6ob3B[[2,256]][[1209,1463]]P21359...2.100x-rayX-ray diffractionFalse20191113201903190.476190-66False11
12P21359-2D21.00False6ob3D[[2,256]][[1209,1463]]P21359...2.100x-rayX-ray diffractionFalse20191113201903190.476190-68False10
13P21359-2B21.00False6v65B[[2,329]][[1203,1530]]P21359...2.763x-rayX-ray diffractionFalse20200805201912040.361925-66False3
14P21359-2B21.00False6v6fB[[2,329]][[1203,1530]]P21359...2.542x-rayX-ray diffractionFalse20200805201912050.393391-66True1

这个过程自动完成了上面提及的reformat及打分等步骤。

同聚体代表集结构选择

df2 = sifts_demo.pipe_select_ho(run_as_completed=True).result()
df2
Click to view dataframe
entity_id_1chain_id_1struct_asym_id_1struct_asym_id_in_assembly_1asym_id_rank_1model_id_1molecule_type_1surface_range_1interface_range_1entity_id_2chain_id_2struct_asym_id_2struct_asym_id_in_assembly_2asym_id_rank_2model_id_2molecule_type_2surface_range_2interface_range_2pdb_idassembly_idinterface_iduse_auUniProt_1identity_1is_canonical_1pdb_range_1unp_range_1Entry_1range_diff_1sifts_range_tag_1repeated_1reversed_1InDel_sum_1new_pdb_range_1new_unp_range_1conflict_pdb_index_1conflict_pdb_range_1conflict_unp_range_1unp_len_1BINDING_LIGAND_COUNT_1BINDING_LIGAND_INDEX_1OBS_COUNT_1OBS_INDEX_1OBS_RATIO_SUM_1ARTIFACT_INDEX_1NON_COUNT_1NON_INDEX_1SEQRES_COUNT_1STD_COUNT_1STD_INDEX_1UNK_COUNT_1UNK_INDEX_1ca_p_only_1OBS_STD_INDEX_1OBS_STD_COUNT_1RAW_BS_1RAW_BS_IG3_1resolutionexperimental_method_classexperimental_methodmulti_methodrevision_datedeposition_date1/resolutionid_score_1select_tag_1select_rank_1UniProt_2identity_2is_canonical_2pdb_range_2unp_range_2Entry_2range_diff_2sifts_range_tag_2repeated_2reversed_2InDel_sum_2new_pdb_range_2new_unp_range_2conflict_pdb_index_2conflict_pdb_range_2conflict_unp_range_2unp_len_2BINDING_LIGAND_COUNT_2BINDING_LIGAND_INDEX_2OBS_COUNT_2OBS_INDEX_2OBS_RATIO_SUM_2ARTIFACT_INDEX_2NON_COUNT_2NON_INDEX_2SEQRES_COUNT_2STD_COUNT_2STD_INDEX_2UNK_COUNT_2UNK_INDEX_2ca_p_only_2OBS_STD_INDEX_2OBS_STD_COUNT_2RAW_BS_2RAW_BS_IG3_2id_score_2select_tag_2select_rank_2in_i3dunp_range_DSCbest_select_rank_scoresecond_select_rank_scoreunp_interface_range_1unp_interface_range_2i_groupi_select_tagi_select_rank
01AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...2d4q03FalseP21359-21.00False[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281812[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0869410.0911992.300x-rayX-ray diffractionFalse20110713200510220.434783-65False4P21359-21.00False[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281811[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0853150.089219-66False6False1.00.2500000.166667((1590, 1591), (1593, 1593), (1626, 1626), (16...((1569, 1569), (1590, 1591), (1593, 1593), (16...(P21359-2, P21359-2)False4
11AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...2d4q13TrueP21359-21.00False[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281812[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0869410.0911992.300x-rayX-ray diffractionFalse20110713200510220.434783-65False4P21359-21.00False[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281811[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0853150.089219-66False6True1.00.2500000.166667((1590, 1591), (1593, 1593), (1626, 1626), (16...((1569, 1569), (1590, 1591), (1593, 1593), (16...(P21359-2, P21359-2)True3
21AAA11polypeptide(L)[[28,44],[47,48],[50,52],[54,62],[64,81],[83,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...1BBB11polypeptide(L)[[28,44],[48,48],[50,62],[64,81],[83,83],[85,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...2e2x05FalseP21359-20.99False[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]248.894[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.0747352.500x-rayX-ray diffractionFalse20110713200611180.400000-65False12P21359-20.99False[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]249.180[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.074735-66False13False1.00.0833330.076923((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-2)False9
31AAA11polypeptide(L)[[28,44],[47,48],[50,52],[54,62],[64,81],[83,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...1BBB11polypeptide(L)[[28,44],[48,48],[50,62],[64,81],[83,83],[85,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...2e2x15TrueP21359-20.99False[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]248.894[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.0747352.500x-rayX-ray diffractionFalse20110713200611180.400000-65False12P21359-20.99False[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]249.180[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.074735-66False13True1.00.0833330.076923((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-2)False8
41AAA11polypeptide(L)[[7,44],[46,47],[49,62],[64,82],[84,84],[86,25...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...1BBB11polypeptide(L)[[7,43],[46,61],[63,80],[82,84],[86,157],[159,...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...3p7z05FalseP21359-20.98False[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281823[[40, 42], [47, 47], [66, 66], [70, 70], [78, ...270[[7, 276]]267.845[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0863800.0945422.650x-rayX-ray diffractionFalse20190717201010130.377358-65False5P21359-20.98False[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281816[[80, 80], [92, 95], [101, 102], [110, 110], [...270[[7, 276]]269.180[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0888640.094542-66True2False1.00.5000000.200000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-2)False2
51AAA11polypeptide(L)[[7,44],[46,47],[49,62],[64,82],[84,84],[86,25...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...1BBB11polypeptide(L)[[7,43],[46,61],[63,80],[82,84],[86,157],[159,...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...3p7z15TrueP21359-20.98False[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281823[[40, 42], [47, 47], [66, 66], [70, 70], [78, ...270[[7, 276]]267.845[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0863800.0945422.650x-rayX-ray diffractionFalse20190717201010130.377358-65False5P21359-20.98False[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281816[[80, 80], [92, 95], [101, 102], [110, 110], [...270[[7, 276]]269.180[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0888640.094542-66True2True1.00.5000000.200000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-2)True1
61AAA11polypeptide(L)[[1,24],[28,235],[237,238],[240,256]][[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,61],[63,63],[65,134],[136,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...3pg704FalseP21359-21.00False[[1,256]][[1560,1816]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]281815[[13, 13], [28, 28], [61, 61], [73, 73], [75, ...256[[1, 256]]247.929[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0838810.0892042.189x-rayX-ray diffractionFalse20110713201010310.456830-65False7P21359-21.00False[[1,256]][[1560,1816]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]281817[[61, 61], [73, 74], [76, 76], [82, 83], [91, ...256[[1, 256]]249.123[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0831710.089204-66False8False1.00.1428570.125000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-2)False5
72BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[228,228],[231,232],[234,235],[237,238],[241,...2DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[91,91],[95,95],[99,99],[194,195],[197,197],[...6ob3010FalseP21359-21.00False[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]281810[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0770380.0805862.100x-rayX-ray diffractionFalse20191113201903190.476190-66False11P21359-21.00False[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28187[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0800830.082567-68False10False1.00.1000000.090909((1435, 1435), (1438, 1439), (1441, 1442), (14...((1298, 1298), (1302, 1302), (1306, 1306), (14...(P21359-2, P21359-2)True7
82BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[125,125],[231,231],[234,235],[237,238],[241,...2DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[99,99],[194,194],[197,197],[199,201]]6ob209FalseP21359-21.00False[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28188[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0737860.0766252.845x-rayX-ray diffractionFalse20191113201903190.351494-66False14P21359-21.00False[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28180[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0805860.080586-68False9False1.00.1111110.071429((1332, 1332), (1438, 1438), (1441, 1442), (14...((1306, 1306), (1401, 1401), (1404, 1404), (14...(P21359-2, P21359-2)True6

这个过程自动完成了上面提及的reformat、打分及与PISA、Interactome3D数据资源整合等步骤,下面的pipe_select_ho_iso,pipe_select_he同理。

同聚体(isoform)代表集结构选择

df3 = sifts_demo.pipe_select_ho_iso(run_as_completed=True).result()
df3
Click to view dataframe
entity_id_1chain_id_1struct_asym_id_1struct_asym_id_in_assembly_1asym_id_rank_1model_id_1molecule_type_1surface_range_1interface_range_1entity_id_2chain_id_2struct_asym_id_2struct_asym_id_in_assembly_2asym_id_rank_2model_id_2molecule_type_2surface_range_2interface_range_2pdb_idassembly_idinterface_iduse_auUniProt_1identifier_1identity_1is_canonical_1name_1pdb_range_1unp_range_1Entry_1range_diff_1sifts_range_tag_1repeated_1reversed_1InDel_sum_1new_pdb_range_1new_unp_range_1conflict_pdb_index_1conflict_pdb_range_1conflict_unp_range_1unp_len_1BINDING_LIGAND_COUNT_1BINDING_LIGAND_INDEX_1OBS_COUNT_1OBS_INDEX_1OBS_RATIO_SUM_1ARTIFACT_INDEX_1NON_COUNT_1NON_INDEX_1SEQRES_COUNT_1STD_COUNT_1STD_INDEX_1UNK_COUNT_1UNK_INDEX_1ca_p_only_1OBS_STD_INDEX_1OBS_STD_COUNT_1RAW_BS_1RAW_BS_IG3_1resolutionexperimental_method_classexperimental_methodmulti_methodrevision_datedeposition_date1/resolutionid_score_1select_tag_1select_rank_1UniProt_2identifier_2identity_2is_canonical_2name_2pdb_range_2unp_range_2Entry_2range_diff_2sifts_range_tag_2repeated_2reversed_2InDel_sum_2new_pdb_range_2new_unp_range_2conflict_pdb_index_2conflict_pdb_range_2conflict_unp_range_2unp_len_2BINDING_LIGAND_COUNT_2BINDING_LIGAND_INDEX_2OBS_COUNT_2OBS_INDEX_2OBS_RATIO_SUM_2ARTIFACT_INDEX_2NON_COUNT_2NON_INDEX_2SEQRES_COUNT_2STD_COUNT_2STD_INDEX_2UNK_COUNT_2UNK_INDEX_2ca_p_only_2OBS_STD_INDEX_2OBS_STD_COUNT_2RAW_BS_2RAW_BS_IG3_2id_score_2select_tag_2select_rank_2in_i3dbest_select_rank_scoresecond_select_rank_scoreunp_interface_range_1unp_interface_range_2i_groupi_select_tagi_select_rank
01BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...1AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...2d4q03FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281811[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0853150.0892192.300x-rayX-ray diffractionFalse20110713200510220.434783-66False6P21359NF1_HUMAN1.00TrueNF1_HUMAN[[1,257]][[1581,1837]]P21359[0]SafeFalseFalse0[[1,257]][[1581,1837]]{}[][]283912[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0862980.090525-65False2False0.5000000.166667((1569, 1569), (1590, 1591), (1593, 1593), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False6
11AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...2d4q03FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281812[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0869410.0911992.300x-rayX-ray diffractionFalse20110713200510220.434783-65False4P21359NF1_HUMAN1.00TrueNF1_HUMAN[[1,257]][[1581,1837]]P21359[0]SafeFalseFalse0[[1,257]][[1581,1837]]{}[][]283911[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0846840.088559-66False4False0.2500000.250000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1590), (1611, 1612), (1614, 1614), (16...(P21359-2, P21359)False2
21AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...2d4q03FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281812[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0869410.0911992.300x-rayX-ray diffractionFalse20110713200510220.434783-65False4P21359-4NF1_HUMAN0.91FalseNF1_HUMAN[[1,11]][[1581,1591]]P21359[0]SafeFalseFalse0[[1,11]][[1581,1591]]{"11":"T"}[[11,11]][[1591,1591]]159811[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))255-0.150814-0.143930-66False-1False-1.0000000.250000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1590),)(P21359-2, P21359-4)False-1
31AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...2d4q03FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281812[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0869410.0911992.300x-rayX-ray diffractionFalse20110713200510220.434783-65False4P21359-6NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]283611[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0847740.088653-66False6False0.2500000.166667((1590, 1591), (1593, 1593), (1626, 1626), (16...((1569, 1569), (1590, 1591), (1593, 1593), (16...(P21359-2, P21359-6)False2
41BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...1AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...2d4q03FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281811[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0853150.0892192.300x-rayX-ray diffractionFalse20110713200510220.434783-66False6P21359-4NF1_HUMAN0.91FalseNF1_HUMAN[[1,11]][[1581,1591]]P21359[0]SafeFalseFalse0[[1,11]][[1581,1591]]{"11":"T"}[[11,11]][[1591,1591]]159812[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)257-0.151439-0.143930-65False-1False-1.0000000.166667((1569, 1569), (1590, 1591), (1593, 1593), (16...()(P21359-2, P21359-4)False-1
51BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...1AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...2d4q03FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281811[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0853150.0892192.300x-rayX-ray diffractionFalse20110713200510220.434783-66False6P21359-6NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]283612[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0863890.090621-65False4False0.2500000.166667((1569, 1569), (1590, 1591), (1593, 1593), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False4
61BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...1AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...2d4q13TrueP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281811[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0853150.0892192.300x-rayX-ray diffractionFalse20110713200510220.434783-66False6P21359NF1_HUMAN1.00TrueNF1_HUMAN[[1,257]][[1581,1837]]P21359[0]SafeFalseFalse0[[1,257]][[1581,1837]]{}[][]283912[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0862980.090525-65False2True0.5000000.166667((1569, 1569), (1590, 1591), (1593, 1593), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False5
71AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...2d4q13TrueP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281812[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0869410.0911992.300x-rayX-ray diffractionFalse20110713200510220.434783-65False4P21359NF1_HUMAN1.00TrueNF1_HUMAN[[1,257]][[1581,1837]]P21359[0]SafeFalseFalse0[[1,257]][[1581,1837]]{}[][]283911[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0846840.088559-66False4True0.2500000.250000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1590), (1611, 1612), (1614, 1614), (16...(P21359-2, P21359)True1
81AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...2d4q13TrueP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281812[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0869410.0911992.300x-rayX-ray diffractionFalse20110713200510220.434783-65False4P21359-4NF1_HUMAN0.91FalseNF1_HUMAN[[1,11]][[1581,1591]]P21359[0]SafeFalseFalse0[[1,11]][[1581,1591]]{"11":"T"}[[11,11]][[1591,1591]]159811[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))255-0.150814-0.143930-66False-1True-1.0000000.250000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1590),)(P21359-2, P21359-4)False-1
91AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...2d4q13TrueP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281812[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0869410.0911992.300x-rayX-ray diffractionFalse20110713200510220.434783-65False4P21359-6NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]283611[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0847740.088653-66False6True0.2500000.166667((1590, 1591), (1593, 1593), (1626, 1626), (16...((1569, 1569), (1590, 1591), (1593, 1593), (16...(P21359-2, P21359-6)True1
101BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...1AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...2d4q13TrueP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281811[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0853150.0892192.300x-rayX-ray diffractionFalse20110713200510220.434783-66False6P21359-4NF1_HUMAN0.91FalseNF1_HUMAN[[1,11]][[1581,1591]]P21359[0]SafeFalseFalse0[[1,11]][[1581,1591]]{"11":"T"}[[11,11]][[1591,1591]]159812[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)257-0.151439-0.143930-65False-1True-1.0000000.166667((1569, 1569), (1590, 1591), (1593, 1593), (16...()(P21359-2, P21359-4)False-1
111BBB11polypeptide(L)[[1,24],[28,28],[30,63],[65,65],[67,120],[123,...[[10,10],[31,32],[34,34],[67,67],[72,72],[148,...1AAA11polypeptide(L)[[1,11],[13,25],[27,28],[30,42],[44,236],[238,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...2d4q13TrueP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]281811[[28, 28], [63, 63], [73, 76], [107, 107], [10...255[[1, 120], [123, 257]]254.009[]0[]257257[[1, 257]]0[]False((1, 120), (123, 257))2550.0853150.0892192.300x-rayX-ray diffractionFalse20110713200510220.434783-66False6P21359-6NF1_HUMAN1.00FalseNF1_HUMAN[[1,257]][[1560,1816]]P21359[0]SafeFalseFalse0[[1,257]][[1560,1816]]{}[][]283612[[61, 61], [73, 76], [82, 82], [86, 86], [107,...257[[1, 257]]257.000[]0[]257257[[1, 257]]0[]False((1, 257),)2570.0863890.090621-65False4True0.2500000.166667((1569, 1569), (1590, 1591), (1593, 1593), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False3
121BBB11polypeptide(L)[[28,44],[48,48],[50,62],[64,81],[83,83],[85,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...1AAA11polypeptide(L)[[28,44],[47,48],[50,52],[54,62],[64,81],[83,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...2e2x05FalseP21359-2NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]249.180[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.0747352.500x-rayX-ray diffractionFalse20110713200611180.400000-66False13P21359NF1_HUMAN0.99TrueNF1_HUMAN[[6,277]][[1566,1837]]P21359[0]SafeFalseFalse0[[6,277]][[1566,1837]]{}[][]28390[]250[[28, 277]]248.894[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0741820.074182-65False7False0.1428570.076923((1590, 1591), (1593, 1593), (1626, 1626), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False15
131AAA11polypeptide(L)[[28,44],[47,48],[50,52],[54,62],[64,81],[83,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...1BBB11polypeptide(L)[[28,44],[48,48],[50,62],[64,81],[83,83],[85,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...2e2x05FalseP21359-2NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]248.894[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.0747352.500x-rayX-ray diffractionFalse20110713200611180.400000-65False12P21359NF1_HUMAN0.99TrueNF1_HUMAN[[6,277]][[1566,1837]]P21359[0]SafeFalseFalse0[[6,277]][[1566,1837]]{}[][]28390[]250[[28, 277]]249.180[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0741820.074182-66False8False0.1250000.083333((1590, 1591), (1593, 1593), (1626, 1626), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False13
141AAA11polypeptide(L)[[28,44],[47,48],[50,52],[54,62],[64,81],[83,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...1BBB11polypeptide(L)[[28,44],[48,48],[50,62],[64,81],[83,83],[85,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...2e2x05FalseP21359-2NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]248.894[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.0747352.500x-rayX-ray diffractionFalse20110713200611180.400000-65False12P21359-6NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28360[]250[[28, 277]]249.180[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0742610.074261-66False13False0.0833330.076923((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False14
151BBB11polypeptide(L)[[28,44],[48,48],[50,62],[64,81],[83,83],[85,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...1AAA11polypeptide(L)[[28,44],[47,48],[50,52],[54,62],[64,81],[83,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...2e2x05FalseP21359-2NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]249.180[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.0747352.500x-rayX-ray diffractionFalse20110713200611180.400000-66False13P21359-6NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28360[]250[[28, 277]]248.894[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0742610.074261-65False12False0.0833330.076923((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False16
161BBB11polypeptide(L)[[28,44],[48,48],[50,62],[64,81],[83,83],[85,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...1AAA11polypeptide(L)[[28,44],[47,48],[50,52],[54,62],[64,81],[83,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...2e2x15TrueP21359-2NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]249.180[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.0747352.500x-rayX-ray diffractionFalse20110713200611180.400000-66False13P21359NF1_HUMAN0.99TrueNF1_HUMAN[[6,277]][[1566,1837]]P21359[0]SafeFalseFalse0[[6,277]][[1566,1837]]{}[][]28390[]250[[28, 277]]248.894[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0741820.074182-65False7True0.1428570.076923((1590, 1591), (1593, 1593), (1626, 1626), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False14
171AAA11polypeptide(L)[[28,44],[47,48],[50,52],[54,62],[64,81],[83,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...1BBB11polypeptide(L)[[28,44],[48,48],[50,62],[64,81],[83,83],[85,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...2e2x15TrueP21359-2NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]248.894[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.0747352.500x-rayX-ray diffractionFalse20110713200611180.400000-65False12P21359NF1_HUMAN0.99TrueNF1_HUMAN[[6,277]][[1566,1837]]P21359[0]SafeFalseFalse0[[6,277]][[1566,1837]]{}[][]28390[]250[[28, 277]]249.180[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0741820.074182-66False8True0.1250000.083333((1590, 1591), (1593, 1593), (1626, 1626), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False12
181AAA11polypeptide(L)[[28,44],[47,48],[50,52],[54,62],[64,81],[83,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...1BBB11polypeptide(L)[[28,44],[48,48],[50,62],[64,81],[83,83],[85,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...2e2x15TrueP21359-2NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]248.894[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.0747352.500x-rayX-ray diffractionFalse20110713200611180.400000-65False12P21359-6NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28360[]250[[28, 277]]249.180[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0742610.074261-66False13True0.0833330.076923((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False13
191BBB11polypeptide(L)[[28,44],[48,48],[50,62],[64,81],[83,83],[85,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...1AAA11polypeptide(L)[[28,44],[47,48],[50,52],[54,62],[64,81],[83,8...[[51,52],[54,54],[87,87],[92,92],[168,169],[17...2e2x15TrueP21359-2NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28180[]250[[28, 277]]249.180[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0747350.0747352.500x-rayX-ray diffractionFalse20110713200611180.400000-66False13P21359-6NF1_HUMAN0.99FalseNF1_HUMAN[[6,277]][[1545,1816]]P21359[0]SafeFalseFalse0[[6,277]][[1545,1816]]{}[][]28360[]250[[28, 277]]248.894[[1, 5]]0[]277277[[1, 277]]0[]False((28, 277),)2500.0742610.074261-65False12True0.0833330.076923((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False15
201BBB11polypeptide(L)[[7,43],[46,61],[63,80],[82,84],[86,157],[159,...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...1AAA11polypeptide(L)[[7,44],[46,47],[49,62],[64,82],[84,84],[86,25...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...3p7z05FalseP21359-2NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281816[[80, 80], [92, 95], [101, 102], [110, 110], [...270[[7, 276]]269.180[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0888640.0945422.650x-rayX-ray diffractionFalse20190717201010130.377358-66True2P21359NF1_HUMAN0.98TrueNF1_HUMAN[[5,276]][[1566,1837]]P21359[0]SafeFalseFalse0[[5,276]][[1566,1837]]{"44":"I"}[[44,44]][[1605,1605]]283923[[40, 42], [47, 47], [66, 66], [70, 70], [78, ...270[[7, 276]]267.845[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0857410.093842-65False3False0.5000000.333333((1590, 1591), (1593, 1593), (1626, 1626), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False8
211AAA11polypeptide(L)[[7,44],[46,47],[49,62],[64,82],[84,84],[86,25...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...1BBB11polypeptide(L)[[7,43],[46,61],[63,80],[82,84],[86,157],[159,...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...3p7z05FalseP21359-2NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281823[[40, 42], [47, 47], [66, 66], [70, 70], [78, ...270[[7, 276]]267.845[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0863800.0945422.650x-rayX-ray diffractionFalse20190717201010130.377358-65False5P21359NF1_HUMAN0.98TrueNF1_HUMAN[[5,276]][[1566,1837]]P21359[0]SafeFalseFalse0[[5,276]][[1566,1837]]{"44":"I"}[[44,44]][[1605,1605]]283916[[80, 80], [92, 95], [101, 102], [110, 110], [...270[[7, 276]]269.180[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0882070.093842-66True1False1.0000000.200000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False4
221AAA11polypeptide(L)[[7,44],[46,47],[49,62],[64,82],[84,84],[86,25...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...1BBB11polypeptide(L)[[7,43],[46,61],[63,80],[82,84],[86,157],[159,...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...3p7z05FalseP21359-2NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281823[[40, 42], [47, 47], [66, 66], [70, 70], [78, ...270[[7, 276]]267.845[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0863800.0945422.650x-rayX-ray diffractionFalse20190717201010130.377358-65False5P21359-6NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]283616[[80, 80], [92, 95], [101, 102], [110, 110], [...270[[7, 276]]269.180[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0883000.093942-66True2False0.5000000.200000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False6
231BBB11polypeptide(L)[[7,43],[46,61],[63,80],[82,84],[86,157],[159,...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...1AAA11polypeptide(L)[[7,44],[46,47],[49,62],[64,82],[84,84],[86,25...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...3p7z05FalseP21359-2NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281816[[80, 80], [92, 95], [101, 102], [110, 110], [...270[[7, 276]]269.180[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0888640.0945422.650x-rayX-ray diffractionFalse20190717201010130.377358-66True2P21359-6NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]283623[[40, 42], [47, 47], [66, 66], [70, 70], [78, ...270[[7, 276]]267.845[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0858320.093942-65False5False0.5000000.200000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False8
241BBB11polypeptide(L)[[7,43],[46,61],[63,80],[82,84],[86,157],[159,...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...1AAA11polypeptide(L)[[7,44],[46,47],[49,62],[64,82],[84,84],[86,25...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...3p7z15TrueP21359-2NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281816[[80, 80], [92, 95], [101, 102], [110, 110], [...270[[7, 276]]269.180[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0888640.0945422.650x-rayX-ray diffractionFalse20190717201010130.377358-66True2P21359NF1_HUMAN0.98TrueNF1_HUMAN[[5,276]][[1566,1837]]P21359[0]SafeFalseFalse0[[5,276]][[1566,1837]]{"44":"I"}[[44,44]][[1605,1605]]283923[[40, 42], [47, 47], [66, 66], [70, 70], [78, ...270[[7, 276]]267.845[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0857410.093842-65False3True0.5000000.333333((1590, 1591), (1593, 1593), (1626, 1626), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False7
251AAA11polypeptide(L)[[7,44],[46,47],[49,62],[64,82],[84,84],[86,25...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...1BBB11polypeptide(L)[[7,43],[46,61],[63,80],[82,84],[86,157],[159,...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...3p7z15TrueP21359-2NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281823[[40, 42], [47, 47], [66, 66], [70, 70], [78, ...270[[7, 276]]267.845[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0863800.0945422.650x-rayX-ray diffractionFalse20190717201010130.377358-65False5P21359NF1_HUMAN0.98TrueNF1_HUMAN[[5,276]][[1566,1837]]P21359[0]SafeFalseFalse0[[5,276]][[1566,1837]]{"44":"I"}[[44,44]][[1605,1605]]283916[[80, 80], [92, 95], [101, 102], [110, 110], [...270[[7, 276]]269.180[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0882070.093842-66True1True1.0000000.200000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False3
261AAA11polypeptide(L)[[7,44],[46,47],[49,62],[64,82],[84,84],[86,25...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...1BBB11polypeptide(L)[[7,43],[46,61],[63,80],[82,84],[86,157],[159,...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...3p7z15TrueP21359-2NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281823[[40, 42], [47, 47], [66, 66], [70, 70], [78, ...270[[7, 276]]267.845[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0863800.0945422.650x-rayX-ray diffractionFalse20190717201010130.377358-65False5P21359-6NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]283616[[80, 80], [92, 95], [101, 102], [110, 110], [...270[[7, 276]]269.180[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0883000.093942-66True2True0.5000000.200000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False5
271BBB11polypeptide(L)[[7,43],[46,61],[63,80],[82,84],[86,157],[159,...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...1AAA11polypeptide(L)[[7,44],[46,47],[49,62],[64,82],[84,84],[86,25...[[50,51],[53,53],[86,86],[91,91],[167,168],[17...3p7z15TrueP21359-2NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]281816[[80, 80], [92, 95], [101, 102], [110, 110], [...270[[7, 276]]269.180[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0888640.0945422.650x-rayX-ray diffractionFalse20190717201010130.377358-66True2P21359-6NF1_HUMAN0.98FalseNF1_HUMAN[[5,276]][[1545,1816]]P21359[0]SafeFalseFalse0[[5,276]][[1545,1816]]{"44":"I"}[[44,44]][[1584,1584]]283623[[40, 42], [47, 47], [66, 66], [70, 70], [78, ...270[[7, 276]]267.845[[1, 4]]0[]276276[[1, 276]]0[]False((7, 276),)2700.0858320.093942-65False5True0.5000000.200000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False7
281BBB11polypeptide(L)[[1,24],[28,28],[30,61],[63,63],[65,134],[136,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1AAA11polypeptide(L)[[1,24],[28,235],[237,238],[240,256]][[31,32],[34,34],[67,67],[72,72],[148,149],[15...3pg704FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,256]][[1560,1816]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]281817[[61, 61], [73, 74], [76, 76], [82, 83], [91, ...256[[1, 256]]249.123[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0831710.0892042.189x-rayX-ray diffractionFalse20110713201010310.456830-66False8P21359NF1_HUMAN1.00TrueNF1_HUMAN[[1,256]][[1581,1837]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1581, 1771), (1773, 1837)){"191":"K"}[[191,191]][[1771,1771]]283915[[13, 13], [28, 28], [61, 61], [73, 73], [75, ...256[[1, 256]]247.929[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0832610.088544-65False5False0.2000000.125000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False10
291AAA11polypeptide(L)[[1,24],[28,235],[237,238],[240,256]][[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,61],[63,63],[65,134],[136,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...3pg704FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,256]][[1560,1816]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]281815[[13, 13], [28, 28], [61, 61], [73, 73], [75, ...256[[1, 256]]247.929[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0838810.0892042.189x-rayX-ray diffractionFalse20110713201010310.456830-65False7P21359NF1_HUMAN1.00TrueNF1_HUMAN[[1,256]][[1581,1837]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1581, 1771), (1773, 1837)){"191":"K"}[[191,191]][[1771,1771]]283917[[61, 61], [73, 74], [76, 76], [82, 83], [91, ...256[[1, 256]]249.123[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0825560.088544-66False6False0.1666670.142857((1590, 1591), (1593, 1593), (1626, 1626), (16...((1611, 1612), (1614, 1614), (1647, 1647), (16...(P21359-2, P21359)False9
301AAA11polypeptide(L)[[1,24],[28,235],[237,238],[240,256]][[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,61],[63,63],[65,134],[136,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...3pg704FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,256]][[1560,1816]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]281815[[13, 13], [28, 28], [61, 61], [73, 73], [75, ...256[[1, 256]]247.929[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0838810.0892042.189x-rayX-ray diffractionFalse20110713201010310.456830-65False7P21359-4NF1_HUMAN0.91FalseNF1_HUMAN[[1,11]][[1581,1591]]P21359[0]SafeFalseFalse0[[1,11]][[1581,1591]]{"11":"T"}[[11,11]][[1591,1591]]159817[[61, 61], [73, 74], [76, 76], [82, 83], [91, ...256[[1, 256]]249.123[]0[]256256[[1, 256]]0[]False((1, 256),)256-0.153942-0.143304-66False-1False-1.0000000.142857((1590, 1591), (1593, 1593), (1626, 1626), (16...()(P21359-2, P21359-4)False-1
311AAA11polypeptide(L)[[1,24],[28,235],[237,238],[240,256]][[31,32],[34,34],[67,67],[72,72],[148,149],[15...1BBB11polypeptide(L)[[1,24],[28,28],[30,61],[63,63],[65,134],[136,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...3pg704FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,256]][[1560,1816]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]281815[[13, 13], [28, 28], [61, 61], [73, 73], [75, ...256[[1, 256]]247.929[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0838810.0892042.189x-rayX-ray diffractionFalse20110713201010310.456830-65False7P21359-6NF1_HUMAN1.00FalseNF1_HUMAN[[1,256]][[1560,1816]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]283617[[61, 61], [73, 74], [76, 76], [82, 83], [91, ...256[[1, 256]]249.123[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0826430.088638-66False8False0.1428570.125000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False9
321BBB11polypeptide(L)[[1,24],[28,28],[30,61],[63,63],[65,134],[136,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1AAA11polypeptide(L)[[1,24],[28,235],[237,238],[240,256]][[31,32],[34,34],[67,67],[72,72],[148,149],[15...3pg704FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,256]][[1560,1816]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]281817[[61, 61], [73, 74], [76, 76], [82, 83], [91, ...256[[1, 256]]249.123[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0831710.0892042.189x-rayX-ray diffractionFalse20110713201010310.456830-66False8P21359-4NF1_HUMAN0.91FalseNF1_HUMAN[[1,11]][[1581,1591]]P21359[0]SafeFalseFalse0[[1,11]][[1581,1591]]{"11":"T"}[[11,11]][[1591,1591]]159815[[13, 13], [28, 28], [61, 61], [73, 73], [75, ...256[[1, 256]]247.929[]0[]256256[[1, 256]]0[]False((1, 256),)256-0.152691-0.143304-65False-1False-1.0000000.125000((1590, 1591), (1593, 1593), (1626, 1626), (16...()(P21359-2, P21359-4)False-1
331BBB11polypeptide(L)[[1,24],[28,28],[30,61],[63,63],[65,134],[136,...[[31,32],[34,34],[67,67],[72,72],[148,149],[15...1AAA11polypeptide(L)[[1,24],[28,235],[237,238],[240,256]][[31,32],[34,34],[67,67],[72,72],[148,149],[15...3pg704FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[1,256]][[1560,1816]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]281817[[61, 61], [73, 74], [76, 76], [82, 83], [91, ...256[[1, 256]]249.123[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0831710.0892042.189x-rayX-ray diffractionFalse20110713201010310.456830-66False8P21359-6NF1_HUMAN1.00FalseNF1_HUMAN[[1,256]][[1560,1816]]P21359[1]DeletionFalseFalse1((1, 191), (192, 256))((1560, 1750), (1752, 1816)){"191":"K"}[[191,191]][[1750,1750]]283615[[13, 13], [28, 28], [61, 61], [73, 73], [75, ...256[[1, 256]]247.929[]0[]256256[[1, 256]]0[]False((1, 256),)2560.0833490.088638-65False7False0.1428570.125000((1590, 1591), (1593, 1593), (1626, 1626), (16...((1590, 1591), (1593, 1593), (1626, 1626), (16...(P21359-2, P21359-6)False10
342DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[99,99],[194,194],[197,197],[199,201]]2BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[125,125],[231,231],[234,235],[237,238],[241,...6ob209FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28180[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0805860.0805862.845x-rayX-ray diffractionFalse20191113201903190.351494-68False9P21359NF1_HUMAN0.92TrueNF1_HUMAN[[2,256]][[1209,1484]]P21359[21]DeletionFalseFalse21((2, 164), (165, 256))((1209, 1371), (1393, 1484)){"164":"A"}[[164,164]][[1371,1371]]28398[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0390430.041861-66False14False0.1111110.071429((1306, 1306), (1401, 1401), (1404, 1404), (14...((1332, 1332), (1459, 1459), (1462, 1463), (14...(P21359-2, P21359)False18
352BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[125,125],[231,231],[234,235],[237,238],[241,...2DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[99,99],[194,194],[197,197],[199,201]]6ob209FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28188[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0737860.0766252.845x-rayX-ray diffractionFalse20191113201903190.351494-66False14P21359NF1_HUMAN0.92TrueNF1_HUMAN[[2,256]][[1209,1484]]P21359[21]DeletionFalseFalse21((2, 164), (165, 256))((1209, 1371), (1393, 1484)){"164":"A"}[[164,164]][[1371,1371]]28390[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0457930.045793-68False11False0.0909090.071429((1332, 1332), (1438, 1438), (1441, 1442), (14...((1306, 1306), (1422, 1422), (1425, 1425), (14...(P21359-2, P21359)False16
362BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[125,125],[231,231],[234,235],[237,238],[241,...2DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[99,99],[194,194],[197,197],[199,201]]6ob209FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28188[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0737860.0766252.845x-rayX-ray diffractionFalse20191113201903190.351494-66False14P21359-4NF1_HUMAN0.92FalseNF1_HUMAN[[2,256]][[1209,1484]]P21359[21]DeletionFalseFalse21((2, 164), (165, 256))((1209, 1371), (1393, 1484)){"164":"A"}[[164,164]][[1371,1371]]15980[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0813550.081355-68False3False0.3333330.071429((1332, 1332), (1438, 1438), (1441, 1442), (14...((1306, 1306), (1422, 1422), (1425, 1425), (14...(P21359-2, P21359-4)False3
372BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[125,125],[231,231],[234,235],[237,238],[241,...2DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[99,99],[194,194],[197,197],[199,201]]6ob209FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28188[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0737860.0766252.845x-rayX-ray diffractionFalse20191113201903190.351494-66False14P21359-6NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28360[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0800750.080075-68False9False0.1111110.071429((1332, 1332), (1438, 1438), (1441, 1442), (14...((1306, 1306), (1401, 1401), (1404, 1404), (14...(P21359-2, P21359-6)False17
382DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[99,99],[194,194],[197,197],[199,201]]2BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[125,125],[231,231],[234,235],[237,238],[241,...6ob209FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28180[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0805860.0805862.845x-rayX-ray diffractionFalse20191113201903190.351494-68False9P21359-4NF1_HUMAN0.92FalseNF1_HUMAN[[2,256]][[1209,1484]]P21359[21]DeletionFalseFalse21((2, 164), (165, 256))((1209, 1371), (1393, 1484)){"164":"A"}[[164,164]][[1371,1371]]15988[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0693640.074370-66False6False0.1666670.111111((1306, 1306), (1401, 1401), (1404, 1404), (14...((1332, 1332), (1459, 1459), (1462, 1463), (14...(P21359-2, P21359-4)False4
392DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[99,99],[194,194],[197,197],[199,201]]2BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[125,125],[231,231],[234,235],[237,238],[241,...6ob209FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28180[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0805860.0805862.845x-rayX-ray diffractionFalse20191113201903190.351494-68False9P21359-6NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28368[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0733180.076139-66False14False0.1111110.071429((1306, 1306), (1401, 1401), (1404, 1404), (14...((1332, 1332), (1438, 1438), (1441, 1442), (14...(P21359-2, P21359-6)False18
402DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[91,91],[95,95],[99,99],[194,195],[197,197],[...2BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[228,228],[231,232],[234,235],[237,238],[241,...6ob3010FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28187[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0800830.0825672.100x-rayX-ray diffractionFalse20191113201903190.476190-68False10P21359NF1_HUMAN0.92TrueNF1_HUMAN[[2,256]][[1209,1484]]P21359[21]DeletionFalseFalse21((2, 164), (165, 256))((1209, 1371), (1393, 1484)){"164":"A"}[[164,164]][[1371,1371]]283910[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0422710.045793-66False13False0.1000000.076923((1298, 1298), (1302, 1302), (1306, 1306), (14...((1456, 1456), (1459, 1460), (1462, 1463), (14...(P21359-2, P21359)True17
412BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[228,228],[231,232],[234,235],[237,238],[241,...2DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[91,91],[95,95],[99,99],[194,195],[197,197],[...6ob3010FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]281810[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0770380.0805862.100x-rayX-ray diffractionFalse20191113201903190.476190-66False11P21359NF1_HUMAN0.92TrueNF1_HUMAN[[2,256]][[1209,1484]]P21359[21]DeletionFalseFalse21((2, 164), (165, 256))((1209, 1371), (1393, 1484)){"164":"A"}[[164,164]][[1371,1371]]28397[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0452930.047759-68False12False0.0909090.083333((1435, 1435), (1438, 1439), (1441, 1442), (14...((1298, 1298), (1302, 1302), (1306, 1306), (14...(P21359-2, P21359)True11
422BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[228,228],[231,232],[234,235],[237,238],[241,...2DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[91,91],[95,95],[99,99],[194,195],[197,197],[...6ob3010FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]281810[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0770380.0805862.100x-rayX-ray diffractionFalse20191113201903190.476190-66False11P21359-4NF1_HUMAN0.92FalseNF1_HUMAN[[2,256]][[1209,1484]]P21359[21]DeletionFalseFalse21((2, 164), (165, 256))((1209, 1371), (1393, 1484)){"164":"A"}[[164,164]][[1371,1371]]15987[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0804680.084848-68False4False0.2500000.090909((1435, 1435), (1438, 1439), (1441, 1442), (14...((1298, 1298), (1302, 1302), (1306, 1306), (14...(P21359-2, P21359-4)True1
432BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[228,228],[231,232],[234,235],[237,238],[241,...2DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[91,91],[95,95],[99,99],[194,195],[197,197],[...6ob3010FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]281810[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0770380.0805862.100x-rayX-ray diffractionFalse20191113201903190.476190-66False11P21359-6NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28367[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0795750.082043-68False10False0.1000000.090909((1435, 1435), (1438, 1439), (1441, 1442), (14...((1298, 1298), (1302, 1302), (1306, 1306), (14...(P21359-2, P21359-6)True11
442DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[91,91],[95,95],[99,99],[194,195],[197,197],[...2BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[228,228],[231,232],[234,235],[237,238],[241,...6ob3010FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28187[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0800830.0825672.100x-rayX-ray diffractionFalse20191113201903190.476190-68False10P21359-4NF1_HUMAN0.92FalseNF1_HUMAN[[2,256]][[1209,1484]]P21359[21]DeletionFalseFalse21((2, 164), (165, 256))((1209, 1371), (1393, 1484)){"164":"A"}[[164,164]][[1371,1371]]159810[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0750980.081355-66False5False0.2000000.100000((1298, 1298), (1302, 1302), (1306, 1306), (14...((1456, 1456), (1459, 1460), (1462, 1463), (14...(P21359-2, P21359-4)True2
452DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[91,91],[95,95],[99,99],[194,195],[197,197],[...2BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[228,228],[231,232],[234,235],[237,238],[241,...6ob3010FalseP21359-2NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28187[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0800830.0825672.100x-rayX-ray diffractionFalse20191113201903190.476190-68False10P21359-6NF1_HUMAN1.00FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]283610[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0765490.080075-66False11False0.1000000.090909((1298, 1298), (1302, 1302), (1306, 1306), (14...((1435, 1435), (1438, 1439), (1441, 1442), (14...(P21359-2, P21359-6)True12

异聚体代表集结构选择

df4 = sifts_demo.pipe_select_he(run_as_completed=True).result()
df4
Click to view dataframe
entity_id_1chain_id_1struct_asym_id_1struct_asym_id_in_assembly_1asym_id_rank_1model_id_1molecule_type_1surface_range_1interface_range_1entity_id_2chain_id_2struct_asym_id_2struct_asym_id_in_assembly_2asym_id_rank_2model_id_2molecule_type_2surface_range_2interface_range_2pdb_idassembly_idinterface_iduse_auUniProt_1identifier_1identity_1is_canonical_1name_1pdb_range_1unp_range_1Entry_1range_diff_1sifts_range_tag_1repeated_1reversed_1InDel_sum_1new_pdb_range_1new_unp_range_1conflict_pdb_index_1conflict_pdb_range_1conflict_unp_range_1unp_len_1BINDING_LIGAND_COUNT_1BINDING_LIGAND_INDEX_1OBS_COUNT_1OBS_INDEX_1OBS_RATIO_SUM_1ARTIFACT_INDEX_1NON_COUNT_1NON_INDEX_1SEQRES_COUNT_1STD_COUNT_1STD_INDEX_1UNK_COUNT_1UNK_INDEX_1ca_p_only_1OBS_STD_INDEX_1OBS_STD_COUNT_1RAW_BS_1RAW_BS_IG3_1resolutionexperimental_method_classexperimental_methodmulti_methodrevision_datedeposition_date1/resolutionid_score_1select_tag_1select_rank_1UniProt_2identifier_2identity_2is_canonical_2name_2pdb_range_2unp_range_2Entry_2range_diff_2sifts_range_tag_2repeated_2reversed_2InDel_sum_2new_pdb_range_2new_unp_range_2conflict_pdb_index_2conflict_pdb_range_2conflict_unp_range_2unp_len_2BINDING_LIGAND_COUNT_2BINDING_LIGAND_INDEX_2OBS_COUNT_2OBS_INDEX_2OBS_RATIO_SUM_2ARTIFACT_INDEX_2NON_COUNT_2NON_INDEX_2SEQRES_COUNT_2STD_COUNT_2STD_INDEX_2UNK_COUNT_2UNK_INDEX_2ca_p_only_2OBS_STD_INDEX_2OBS_STD_COUNT_2RAW_BS_2RAW_BS_IG3_2id_score_2select_tag_2select_rank_2in_i3dbest_select_rank_scoresecond_select_rank_scoreunp_interface_range_1unp_interface_range_2i_groupi_select_tagi_select_rank
02BBB11polypeptide(L)[[4,22],[24,54],[56,59],[61,82],[84,86],[88,91...[[32,33],[36,36],[71,72],[75,77],[82,82],[85,8...3CCC11polypeptide(L)[[2,9],[11,51],[53,53],[55,55],[57,72],[74,77]...[[12,13],[18,18],[22,22],[26,26],[30,30],[32,4...6v6f01FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28181[[218, 218]]301[[4, 263], [276, 300], [312, 327]]294.205[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (276, 300), (312, 327))3010.0893010.0896562.542x-rayX-ray diffractionFalse20200805201912050.393391-66True1P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"62":"Q","152":"R","154":"E","166":"Q","167":...[[62,62],[152,152],[154,154],[166,169]][[61,61],[151,151],[153,153],[165,168]]18924[[14, 19], [29, 33], [35, 36], [58, 58], [61, ...168[[2, 169]]166.665[[1, 1]]0[]170170[[1, 170]]0[]False((2, 169),)1680.7524300.879414-67False4False1.0000000.250000((1233, 1234), (1237, 1237), (1272, 1273), (12...((11, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116)False11
12BBB11polypeptide(L)[[4,22],[24,54],[56,59],[61,82],[84,86],[88,91...[[32,33],[36,36],[71,72],[75,77],[82,82],[85,8...3CCC11polypeptide(L)[[2,9],[11,51],[53,53],[55,55],[57,72],[74,77]...[[12,13],[18,18],[22,22],[26,26],[30,30],[32,4...6v6f01FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28181[[218, 218]]301[[4, 263], [276, 300], [312, 327]]294.205[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (276, 300), (312, 327))3010.0893010.0896562.542x-rayX-ray diffractionFalse20200805201912050.393391-66True1P01116-2RASK_HUMAN0.99FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"62":"Q"}[[62,62]][[61,61]]18824[[14, 19], [29, 33], [35, 36], [58, 58], [61, ...168[[2, 169]]166.665[[1, 1]]0[]170170[[1, 170]]0[]False((2, 169),)1680.7564320.884092-67False4False1.0000000.250000((1233, 1234), (1237, 1237), (1272, 1273), (12...((11, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116-2)False11
22BBB11polypeptide(L)[[4,22],[24,54],[56,59],[61,82],[84,86],[88,91...[[32,33],[36,36],[71,72],[75,77],[82,82],[85,8...3CCC11polypeptide(L)[[2,9],[11,51],[53,53],[55,55],[57,72],[74,77]...[[12,13],[18,18],[22,22],[26,26],[30,30],[32,4...6v6f13FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28181[[218, 218]]301[[4, 263], [276, 300], [312, 327]]294.205[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (276, 300), (312, 327))3010.0893010.0896562.542x-rayX-ray diffractionFalse20200805201912050.393391-66True1P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"62":"Q","152":"R","154":"E","166":"Q","167":...[[62,62],[152,152],[154,154],[166,169]][[61,61],[151,151],[153,153],[165,168]]18924[[14, 19], [29, 33], [35, 36], [58, 58], [61, ...168[[2, 169]]166.665[[1, 1]]0[]170170[[1, 170]]0[]False((2, 169),)1680.7524300.879414-67False4False1.0000000.250000((1233, 1234), (1237, 1237), (1272, 1273), (12...((11, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116)False12
32BBB11polypeptide(L)[[4,22],[24,54],[56,59],[61,82],[84,86],[88,91...[[32,33],[36,36],[71,72],[75,77],[82,82],[85,8...3CCC11polypeptide(L)[[2,9],[11,51],[53,53],[55,55],[57,72],[74,77]...[[12,13],[18,18],[22,22],[26,26],[30,30],[32,4...6v6f13FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28181[[218, 218]]301[[4, 263], [276, 300], [312, 327]]294.205[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (276, 300), (312, 327))3010.0893010.0896562.542x-rayX-ray diffractionFalse20200805201912050.393391-66True1P01116-2RASK_HUMAN0.99FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"62":"Q"}[[62,62]][[61,61]]18824[[14, 19], [29, 33], [35, 36], [58, 58], [61, ...168[[2, 169]]166.665[[1, 1]]0[]170170[[1, 170]]0[]False((2, 169),)1680.7564320.884092-67False4False1.0000000.250000((1233, 1234), (1237, 1237), (1272, 1273), (12...((11, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116-2)False12
42BBB11polypeptide(L)[[4,22],[24,54],[56,59],[61,82],[84,86],[88,91...[[10,10],[12,19],[21,21],[49,51],[53,54],[57,5...1AAA11polypeptide(L)[[1,22],[25,47],[49,87],[89,99],[101,112]][[8,8],[10,10],[12,21],[25,25],[27,27],[73,77]...6v6f02FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28181[[218, 218]]301[[4, 263], [276, 300], [312, 327]]294.205[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (276, 300), (312, 327))3010.0893010.0896562.542x-rayX-ray diffractionFalse20200805201912050.393391-66True1Q7Z699SPRE1_HUMAN1.00TrueSPRE1_HUMAN[[1,113]][[13,125]]Q7Z699[0]SafeFalseFalse0[[1,113]][[13,125]]{}[][]4443[[37, 37], [43, 43], [98, 98]]110[[1, 22], [25, 112]]108.533[]0[]113113[[1, 113]]0[]False((1, 22), (25, 112))1100.2288910.235648-65False2False1.0000000.500000((1211, 1211), (1213, 1220), (1222, 1222), (12...((20, 20), (22, 22), (24, 33), (37, 37), (39, ...(P21359-2, Q7Z699)False3
52BBB11polypeptide(L)[[4,22],[24,54],[56,59],[61,82],[84,86],[88,91...[[168,168],[177,178],[229,231],[247,247],[250,...1AAA12polypeptide(L)[[1,22],[25,31],[33,47],[49,87],[89,99],[101,1...[[35,39],[43,45],[63,63],[65,66],[84,84],[111,...6v6f18FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28181[[218, 218]]301[[4, 263], [276, 300], [312, 327]]294.205[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (276, 300), (312, 327))3010.0893010.0896562.542x-rayX-ray diffractionFalse20200805201912050.393391-66True1Q7Z699SPRE1_HUMAN1.00TrueSPRE1_HUMAN[[1,113]][[13,125]]Q7Z699[0]SafeFalseFalse0[[1,113]][[13,125]]{}[][]4443[[37, 37], [43, 43], [98, 98]]110[[1, 22], [25, 112]]108.533[]0[]113113[[1, 113]]0[]False((1, 22), (25, 112))1100.2288910.235648-65False2False1.0000000.500000((1369, 1369), (1378, 1379), (1430, 1432), (14...((47, 51), (55, 57), (75, 75), (77, 78), (96, ...(P21359-2, Q7Z699)False4
62BBB11polypeptide(L)[[4,59],[61,83],[85,86],[88,91],[93,137],[139,...[[32,33],[35,35],[71,72],[75,77],[81,82],[85,8...3CCC11polypeptide(L)[[2,9],[11,51],[53,55],[57,72],[74,77],[80,81]...[[13,13],[18,18],[22,22],[26,26],[30,42],[55,5...6v6501FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28182[[240, 240], [243, 243]]300[[4, 263], [277, 300], [312, 327]]293.261[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (277, 300), (312, 327))3000.0879560.0886662.763x-rayX-ray diffractionFalse20200805201912040.361925-66False3P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"152":"R","154":"E","166":"Q","167":"Y","168"...[[152,152],[154,154],[166,169]][[151,151],[153,153],[165,168]]18920[[14, 14], [16, 19], [29, 32], [35, 36], [58, ...168[[2, 169]]166.776[[1, 1]]0[]170170[[1, 170]]0[]False((2, 169),)1680.7735940.879414-67False2False0.5000000.333333((1233, 1234), (1236, 1236), (1272, 1273), (12...((12, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116)False13
72BBB11polypeptide(L)[[4,59],[61,83],[85,86],[88,91],[93,137],[139,...[[32,33],[35,35],[71,72],[75,77],[81,82],[85,8...3CCC11polypeptide(L)[[2,9],[11,51],[53,55],[57,72],[74,77],[80,81]...[[13,13],[18,18],[22,22],[26,26],[30,42],[55,5...6v6501FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28182[[240, 240], [243, 243]]300[[4, 263], [277, 300], [312, 327]]293.261[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (277, 300), (312, 327))3000.0879560.0886662.763x-rayX-ray diffractionFalse20200805201912040.361925-66False3P01116-2RASK_HUMAN1.00FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{}[][]18820[[14, 14], [16, 19], [29, 32], [35, 36], [58, ...168[[2, 169]]166.776[[1, 1]]0[]170170[[1, 170]]0[]False((2, 169),)1680.7777090.884092-67False2False0.5000000.333333((1233, 1234), (1236, 1236), (1272, 1273), (12...((12, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116-2)False13
82BBB11polypeptide(L)[[4,59],[61,83],[85,86],[88,91],[93,137],[139,...[[32,33],[35,35],[71,72],[75,77],[81,82],[85,8...3CCC11polypeptide(L)[[2,9],[11,51],[53,55],[57,72],[74,77],[80,81]...[[13,13],[18,18],[22,22],[26,26],[30,42],[55,5...6v6513FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28182[[240, 240], [243, 243]]300[[4, 263], [277, 300], [312, 327]]293.261[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (277, 300), (312, 327))3000.0879560.0886662.763x-rayX-ray diffractionFalse20200805201912040.361925-66False3P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"152":"R","154":"E","166":"Q","167":"Y","168"...[[152,152],[154,154],[166,169]][[151,151],[153,153],[165,168]]18920[[14, 14], [16, 19], [29, 32], [35, 36], [58, ...168[[2, 169]]166.776[[1, 1]]0[]170170[[1, 170]]0[]False((2, 169),)1680.7735940.879414-67False2False0.5000000.333333((1233, 1234), (1236, 1236), (1272, 1273), (12...((12, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116)False14
92BBB11polypeptide(L)[[4,59],[61,83],[85,86],[88,91],[93,137],[139,...[[32,33],[35,35],[71,72],[75,77],[81,82],[85,8...3CCC11polypeptide(L)[[2,9],[11,51],[53,55],[57,72],[74,77],[80,81]...[[13,13],[18,18],[22,22],[26,26],[30,42],[55,5...6v6513FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28182[[240, 240], [243, 243]]300[[4, 263], [277, 300], [312, 327]]293.261[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (277, 300), (312, 327))3000.0879560.0886662.763x-rayX-ray diffractionFalse20200805201912040.361925-66False3P01116-2RASK_HUMAN1.00FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{}[][]18820[[14, 14], [16, 19], [29, 32], [35, 36], [58, ...168[[2, 169]]166.776[[1, 1]]0[]170170[[1, 170]]0[]False((2, 169),)1680.7777090.884092-67False2False0.5000000.333333((1233, 1234), (1236, 1236), (1272, 1273), (12...((12, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116-2)False14
102BBB11polypeptide(L)[[4,59],[61,83],[85,86],[88,91],[93,137],[139,...[[9,10],[12,18],[49,51],[53,54],[57,58],[162,1...1AAA11polypeptide(L)[[1,22],[25,31],[33,87],[89,99],[101,106],[108...[[8,8],[10,10],[12,12],[14,21],[25,25],[27,27]...6v6502FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28182[[240, 240], [243, 243]]300[[4, 263], [277, 300], [312, 327]]293.261[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (277, 300), (312, 327))3000.0879560.0886662.763x-rayX-ray diffractionFalse20200805201912040.361925-66False3Q7Z699SPRE1_HUMAN1.00TrueSPRE1_HUMAN[[1,113]][[13,125]]Q7Z699[0]SafeFalseFalse0[[1,113]][[13,125]]{}[][]4442[[37, 37], [43, 43]]110[[1, 22], [25, 112]]108.199[]0[]113113[[1, 113]]0[]False((1, 22), (25, 112))1100.2311440.235648-65True1False1.0000000.333333((1210, 1211), (1213, 1219), (1250, 1252), (12...((20, 20), (22, 22), (24, 24), (26, 33), (37, ...(P21359-2, Q7Z699)True1
112BBB11polypeptide(L)[[4,59],[61,83],[85,86],[88,91],[93,137],[139,...[[177,178],[229,231],[250,251],[254,255],[258,...1AAA12polypeptide(L)[[1,22],[25,31],[33,87],[89,99],[101,106],[108...[[35,38],[40,40],[43,45],[63,63],[66,66],[111,...6v6518FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,329]][[1203,1530]]P21359[0]SafeFalseFalse0[[2,329]][[1203,1530]]{}[][]28182[[240, 240], [243, 243]]300[[4, 263], [277, 300], [312, 327]]293.261[[1, 1]]0[]329329[[1, 329]]0[]False((4, 263), (277, 300), (312, 327))3000.0879560.0886662.763x-rayX-ray diffractionFalse20200805201912040.361925-66False3Q7Z699SPRE1_HUMAN1.00TrueSPRE1_HUMAN[[1,113]][[13,125]]Q7Z699[0]SafeFalseFalse0[[1,113]][[13,125]]{}[][]4442[[37, 37], [43, 43]]110[[1, 22], [25, 112]]108.199[]0[]113113[[1, 113]]0[]False((1, 22), (25, 112))1100.2311440.235648-65True1False1.0000000.333333((1378, 1379), (1430, 1432), (1451, 1452), (14...((47, 50), (52, 52), (55, 57), (75, 75), (78, ...(P21359-2, Q7Z699)True2
122BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[25,27],[30,30],[64,71],[76,76],[79,79],[115,...1AAA11polypeptide(L)[[1,8],[11,20],[22,53],[55,55],[57,72],[74,77]...[[13,14],[18,18],[22,22],[25,26],[31,42],[55,5...6ob301FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]281810[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0770380.0805862.100x-rayX-ray diffractionFalse20191113201903190.476190-66False11P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"14":"G","152":"R","154":"E","166":"Q","167":...[[14,14],[152,152],[154,154],[166,169]][[13,13],[151,151],[153,153],[165,168]]18922[[13, 19], [23, 23], [29, 31], [35, 36], [61, ...170[[1, 170]]169.566[[1, 1]]0[]170170[[1, 170]]0[]False((1, 170),)1700.7777780.894180-65True1False1.0000000.090909((1232, 1234), (1237, 1237), (1271, 1278), (12...((12, 13), (17, 17), (21, 21), (24, 25), (30, ...(P21359-2, P01116)False3
132BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[25,27],[30,30],[64,71],[76,76],[79,79],[115,...1AAA11polypeptide(L)[[1,8],[11,20],[22,53],[55,55],[57,72],[74,77]...[[13,14],[18,18],[22,22],[25,26],[31,42],[55,5...6ob301FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]281810[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0770380.0805862.100x-rayX-ray diffractionFalse20191113201903190.476190-66False11P01116-2RASK_HUMAN0.99FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"14":"G"}[[14,14]][[13,13]]18822[[13, 19], [23, 23], [29, 31], [35, 36], [61, ...170[[1, 170]]169.566[[1, 1]]0[]170170[[1, 170]]0[]False((1, 170),)1700.7819150.898936-65True1False1.0000000.090909((1232, 1234), (1237, 1237), (1271, 1278), (12...((12, 13), (17, 17), (21, 21), (24, 25), (30, ...(P21359-2, P01116-2)False3
142BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[25,27],[30,30],[64,71],[76,76],[79,79],[115,...1AAA11polypeptide(L)[[1,8],[11,20],[22,53],[55,55],[57,72],[74,77]...[[13,14],[18,18],[22,22],[25,26],[31,42],[55,5...6ob311TrueP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]281810[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0770380.0805862.100x-rayX-ray diffractionFalse20191113201903190.476190-66False11P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"14":"G","152":"R","154":"E","166":"Q","167":...[[14,14],[152,152],[154,154],[166,169]][[13,13],[151,151],[153,153],[165,168]]18922[[13, 19], [23, 23], [29, 31], [35, 36], [61, ...170[[1, 170]]169.566[[1, 1]]0[]170170[[1, 170]]0[]False((1, 170),)1700.7777780.894180-65True1True1.0000000.090909((1232, 1234), (1237, 1237), (1271, 1278), (12...((12, 13), (17, 17), (21, 21), (24, 25), (30, ...(P21359-2, P01116)True2
152BBB11polypeptide(L)[[12,48],[50,53],[55,73],[75,76],[78,80],[82,8...[[25,27],[30,30],[64,71],[76,76],[79,79],[115,...1AAA11polypeptide(L)[[1,8],[11,20],[22,53],[55,55],[57,72],[74,77]...[[13,14],[18,18],[22,22],[25,26],[31,42],[55,5...6ob311TrueP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]281810[[41, 41], [51, 51], [69, 69], [129, 130], [13...245[[12, 256]]241.495[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0770380.0805862.100x-rayX-ray diffractionFalse20191113201903190.476190-66False11P01116-2RASK_HUMAN0.99FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"14":"G"}[[14,14]][[13,13]]18822[[13, 19], [23, 23], [29, 31], [35, 36], [61, ...170[[1, 170]]169.566[[1, 1]]0[]170170[[1, 170]]0[]False((1, 170),)1700.7819150.898936-65True1True1.0000000.090909((1232, 1234), (1237, 1237), (1271, 1278), (12...((12, 13), (17, 17), (21, 21), (24, 25), (30, ...(P21359-2, P01116-2)True2
162DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[28,28],[31,31],[34,35],[38,38],[42,42],[80,8...1AAA11polypeptide(L)[[1,8],[11,20],[22,53],[55,55],[57,72],[74,77]...[[24,24],[26,28],[43,46],[51,51],[149,150],[15...6ob309FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28187[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0800830.0825672.100x-rayX-ray diffractionFalse20191113201903190.476190-68False10P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"14":"G","152":"R","154":"E","166":"Q","167":...[[14,14],[152,152],[154,154],[166,169]][[13,13],[151,151],[153,153],[165,168]]18922[[13, 19], [23, 23], [29, 31], [35, 36], [61, ...170[[1, 170]]169.566[[1, 1]]0[]170170[[1, 170]]0[]False((1, 170),)1700.7777780.894180-65True1False1.0000000.100000((1235, 1235), (1238, 1238), (1241, 1242), (12...((23, 23), (25, 27), (42, 45), (50, 50), (148,...(P21359-2, P01116)True1
172DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[28,28],[31,31],[34,35],[38,38],[42,42],[80,8...1AAA11polypeptide(L)[[1,8],[11,20],[22,53],[55,55],[57,72],[74,77]...[[24,24],[26,28],[43,46],[51,51],[149,150],[15...6ob309FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28187[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0800830.0825672.100x-rayX-ray diffractionFalse20191113201903190.476190-68False10P01116-2RASK_HUMAN0.99FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"14":"G"}[[14,14]][[13,13]]18822[[13, 19], [23, 23], [29, 31], [35, 36], [61, ...170[[1, 170]]169.566[[1, 1]]0[]170170[[1, 170]]0[]False((1, 170),)1700.7819150.898936-65True1False1.0000000.100000((1235, 1235), (1238, 1238), (1241, 1242), (12...((23, 23), (25, 27), (42, 45), (50, 50), (148,...(P21359-2, P01116-2)True1
182DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[26,27],[29,30],[65,71],[75,76],[79,79],[83,8...1CCC11polypeptide(L)[[2,20],[22,23],[25,51],[53,53],[55,55],[57,72...[[12,14],[18,18],[22,22],[26,26],[28,28],[30,3...6ob302FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28187[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0800830.0825672.100x-rayX-ray diffractionFalse20191113201903190.476190-68False10P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"14":"G","152":"R","154":"E","166":"Q","167":...[[14,14],[152,152],[154,154],[166,169]][[13,13],[151,151],[153,153],[165,168]]18921[[13, 19], [29, 31], [35, 36], [61, 61], [69, ...166[[2, 167]]164.220[[1, 1]]0[]170170[[1, 170]]0[]False((2, 167),)1660.7387720.849883-67False5False0.2000000.100000((1233, 1234), (1236, 1237), (1272, 1278), (12...((11, 13), (17, 17), (21, 21), (25, 25), (27, ...(P21359-2, P01116)False8
192DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[26,27],[29,30],[65,71],[75,76],[79,79],[83,8...1CCC11polypeptide(L)[[2,20],[22,23],[25,51],[53,53],[55,55],[57,72...[[12,14],[18,18],[22,22],[26,26],[28,28],[30,3...6ob302FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28187[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0800830.0825672.100x-rayX-ray diffractionFalse20191113201903190.476190-68False10P01116-2RASK_HUMAN0.99FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"14":"G"}[[14,14]][[13,13]]18821[[13, 19], [29, 31], [35, 36], [61, 61], [69, ...166[[2, 167]]164.220[[1, 1]]0[]170170[[1, 170]]0[]False((2, 167),)1660.7427010.854403-67False5False0.2000000.100000((1233, 1234), (1236, 1237), (1272, 1278), (12...((11, 13), (17, 17), (21, 21), (25, 25), (27, ...(P21359-2, P01116-2)False8
202DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[26,27],[29,30],[65,71],[75,76],[79,79],[83,8...1CCC11polypeptide(L)[[2,20],[22,23],[25,51],[53,53],[55,55],[57,72...[[12,14],[18,18],[22,22],[26,26],[28,28],[30,3...6ob322TrueP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28187[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0800830.0825672.100x-rayX-ray diffractionFalse20191113201903190.476190-68False10P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"14":"G","152":"R","154":"E","166":"Q","167":...[[14,14],[152,152],[154,154],[166,169]][[13,13],[151,151],[153,153],[165,168]]18921[[13, 19], [29, 31], [35, 36], [61, 61], [69, ...166[[2, 167]]164.220[[1, 1]]0[]170170[[1, 170]]0[]False((2, 167),)1660.7387720.849883-67False5True0.2000000.100000((1233, 1234), (1236, 1237), (1272, 1278), (12...((11, 13), (17, 17), (21, 21), (25, 25), (27, ...(P21359-2, P01116)False7
212DDD11polypeptide(L)[[10,48],[50,53],[55,76],[79,80],[82,85],[87,9...[[26,27],[29,30],[65,71],[75,76],[79,79],[83,8...1CCC11polypeptide(L)[[2,20],[22,23],[25,51],[53,53],[55,55],[57,72...[[12,14],[18,18],[22,22],[26,26],[28,28],[30,3...6ob322TrueP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28187[[41, 41], [44, 44], [126, 126], [133, 133], [...247[[10, 256]]244.327[[1, 1]]0[]256256[[1, 256]]0[]False((10, 256),)2470.0800830.0825672.100x-rayX-ray diffractionFalse20191113201903190.476190-68False10P01116-2RASK_HUMAN0.99FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"14":"G"}[[14,14]][[13,13]]18821[[13, 19], [29, 31], [35, 36], [61, 61], [69, ...166[[2, 167]]164.220[[1, 1]]0[]170170[[1, 170]]0[]False((2, 167),)1660.7427010.854403-67False5True0.2000000.100000((1233, 1234), (1236, 1237), (1272, 1278), (12...((11, 13), (17, 17), (21, 21), (25, 25), (27, ...(P21359-2, P01116-2)False7
222BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[26,27],[29,30],[65,66],[69,71],[76,76],[79,7...1AAA11polypeptide(L)[[2,8],[10,51],[53,55],[57,77],[80,112],[114,1...[[13,13],[18,18],[22,22],[26,26],[30,42],[58,5...6ob201FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28188[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0737860.0766252.845x-rayX-ray diffractionFalse20191113201903190.351494-66False14P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"152":"R","154":"E","166":"Q","167":"Y","168"...[[152,152],[154,154],[166,169]][[151,151],[153,153],[165,168]]18925[[13, 19], [29, 32], [35, 36], [61, 61], [68, ...169[[2, 170]]166.890[[1, 1]]0[]170170[[1, 170]]0[]False((2, 170),)1690.7619050.894180-65False3False0.3333330.071429((1233, 1234), (1236, 1237), (1272, 1273), (12...((12, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116)False6
232BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[26,27],[29,30],[65,66],[69,71],[76,76],[79,7...1AAA11polypeptide(L)[[2,8],[10,51],[53,55],[57,77],[80,112],[114,1...[[13,13],[18,18],[22,22],[26,26],[30,42],[58,5...6ob201FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28188[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0737860.0766252.845x-rayX-ray diffractionFalse20191113201903190.351494-66False14P01116-2RASK_HUMAN1.00FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{}[][]18825[[13, 19], [29, 32], [35, 36], [61, 61], [68, ...169[[2, 170]]166.890[[1, 1]]0[]170170[[1, 170]]0[]False((2, 170),)1690.7659570.898936-65False3False0.3333330.071429((1233, 1234), (1236, 1237), (1272, 1273), (12...((12, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116-2)False6
242BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[26,27],[29,30],[65,66],[69,71],[76,76],[79,7...1AAA11polypeptide(L)[[2,8],[10,51],[53,55],[57,77],[80,112],[114,1...[[13,13],[18,18],[22,22],[26,26],[30,42],[58,5...6ob211TrueP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28188[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0737860.0766252.845x-rayX-ray diffractionFalse20191113201903190.351494-66False14P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"152":"R","154":"E","166":"Q","167":"Y","168"...[[152,152],[154,154],[166,169]][[151,151],[153,153],[165,168]]18925[[13, 19], [29, 32], [35, 36], [61, 61], [68, ...169[[2, 170]]166.890[[1, 1]]0[]170170[[1, 170]]0[]False((2, 170),)1690.7619050.894180-65False3True0.3333330.071429((1233, 1234), (1236, 1237), (1272, 1273), (12...((12, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116)False5
252BBB11polypeptide(L)[[12,48],[50,53],[55,76],[78,80],[82,85],[87,1...[[26,27],[29,30],[65,66],[69,71],[76,76],[79,7...1AAA11polypeptide(L)[[2,8],[10,51],[53,55],[57,77],[80,112],[114,1...[[13,13],[18,18],[22,22],[26,26],[30,42],[58,5...6ob211TrueP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28188[[18, 19], [22, 22], [191, 192], [202, 202], [...241[[12, 103], [107, 255]]236.933[[1, 1]]0[]256256[[1, 256]]0[]False((12, 103), (107, 255))2410.0737860.0766252.845x-rayX-ray diffractionFalse20191113201903190.351494-66False14P01116-2RASK_HUMAN1.00FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{}[][]18825[[13, 19], [29, 32], [35, 36], [61, 61], [68, ...169[[2, 170]]166.890[[1, 1]]0[]170170[[1, 170]]0[]False((2, 170),)1690.7659570.898936-65False3True0.3333330.071429((1233, 1234), (1236, 1237), (1272, 1273), (12...((12, 12), (17, 17), (21, 21), (25, 25), (29, ...(P21359-2, P01116-2)False5
262DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[28,28],[31,31],[34,35],[38,38],[42,42],[80,8...1AAA11polypeptide(L)[[2,8],[10,51],[53,55],[57,77],[80,112],[114,1...[[24,24],[26,28],[43,46],[49,49],[51,51],[149,...6ob207FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28180[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0805860.0805862.845x-rayX-ray diffractionFalse20191113201903190.351494-68False9P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"152":"R","154":"E","166":"Q","167":"Y","168"...[[152,152],[154,154],[166,169]][[151,151],[153,153],[165,168]]18925[[13, 19], [29, 32], [35, 36], [61, 61], [68, ...169[[2, 170]]166.890[[1, 1]]0[]170170[[1, 170]]0[]False((2, 170),)1690.7619050.894180-65False3False0.3333330.111111((1235, 1235), (1238, 1238), (1241, 1242), (12...((23, 23), (25, 27), (42, 45), (48, 48), (50, ...(P21359-2, P01116)False4
272DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[28,28],[31,31],[34,35],[38,38],[42,42],[80,8...1AAA11polypeptide(L)[[2,8],[10,51],[53,55],[57,77],[80,112],[114,1...[[24,24],[26,28],[43,46],[49,49],[51,51],[149,...6ob207FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28180[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0805860.0805862.845x-rayX-ray diffractionFalse20191113201903190.351494-68False9P01116-2RASK_HUMAN1.00FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{}[][]18825[[13, 19], [29, 32], [35, 36], [61, 61], [68, ...169[[2, 170]]166.890[[1, 1]]0[]170170[[1, 170]]0[]False((2, 170),)1690.7659570.898936-65False3False0.3333330.111111((1235, 1235), (1238, 1238), (1241, 1242), (12...((23, 23), (25, 27), (42, 45), (48, 48), (50, ...(P21359-2, P01116-2)False4
282DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[26,26],[65,66],[69,71],[76,76],[79,79],[83,8...1CCC11polypeptide(L)[[2,20],[22,166]][[12,13],[18,18],[22,22],[30,42],[55,55],[57,5...6ob202FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28180[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0805860.0805862.845x-rayX-ray diffractionFalse20191113201903190.351494-68False9P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"152":"R","154":"E","166":"Q","167":"Y","168"...[[152,152],[154,154],[166,169]][[151,151],[153,153],[165,168]]18919[[13, 19], [29, 31], [35, 36], [61, 61], [117,...165[[2, 166]]143.555[[1, 1]]0[]170170[[1, 170]]0[]False((2, 166),)1650.7345880.835117-67False6False0.1666670.111111((1233, 1233), (1272, 1273), (1276, 1278), (12...((11, 12), (17, 17), (21, 21), (29, 41), (54, ...(P21359-2, P01116)False10
292DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[26,26],[65,66],[69,71],[76,76],[79,79],[83,8...1CCC11polypeptide(L)[[2,20],[22,166]][[12,13],[18,18],[22,22],[30,42],[55,55],[57,5...6ob202FalseP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28180[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0805860.0805862.845x-rayX-ray diffractionFalse20191113201903190.351494-68False9P01116-2RASK_HUMAN1.00FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{}[][]18819[[13, 19], [29, 31], [35, 36], [61, 61], [117,...165[[2, 166]]143.555[[1, 1]]0[]170170[[1, 170]]0[]False((2, 166),)1650.7384950.839559-67False6False0.1666670.111111((1233, 1233), (1272, 1273), (1276, 1278), (12...((11, 12), (17, 17), (21, 21), (29, 41), (54, ...(P21359-2, P01116-2)False10
302DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[26,26],[65,66],[69,71],[76,76],[79,79],[83,8...1CCC11polypeptide(L)[[2,20],[22,166]][[12,13],[18,18],[22,22],[30,42],[55,55],[57,5...6ob222TrueP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28180[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0805860.0805862.845x-rayX-ray diffractionFalse20191113201903190.351494-68False9P01116RASK_HUMAN0.96TrueRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{"152":"R","154":"E","166":"Q","167":"Y","168"...[[152,152],[154,154],[166,169]][[151,151],[153,153],[165,168]]18919[[13, 19], [29, 31], [35, 36], [61, 61], [117,...165[[2, 166]]143.555[[1, 1]]0[]170170[[1, 170]]0[]False((2, 166),)1650.7345880.835117-67False6True0.1666670.111111((1233, 1233), (1272, 1273), (1276, 1278), (12...((11, 12), (17, 17), (21, 21), (29, 41), (54, ...(P21359-2, P01116)False9
312DDD11polypeptide(L)[[12,53],[55,76],[78,80],[82,85],[87,100],[102...[[26,26],[65,66],[69,71],[76,76],[79,79],[83,8...1CCC11polypeptide(L)[[2,20],[22,166]][[12,13],[18,18],[22,22],[30,42],[55,55],[57,5...6ob222TrueP21359-2NF1_HUMAN1.0FalseNF1_HUMAN[[2,256]][[1209,1463]]P21359[0]SafeFalseFalse0[[2,256]][[1209,1463]]{}[][]28180[]245[[12, 256]]239.938[[1, 1]]0[]256256[[1, 256]]0[]False((12, 256),)2450.0805860.0805862.845x-rayX-ray diffractionFalse20191113201903190.351494-68False9P01116-2RASK_HUMAN1.00FalseRASK_HUMAN[[2,170]][[1,169]]P01116[0]SafeFalseFalse0[[2,170]][[1,169]]{}[][]18819[[13, 19], [29, 31], [35, 36], [61, 61], [117,...165[[2, 166]]143.555[[1, 1]]0[]170170[[1, 170]]0[]False((2, 166),)1650.7384950.839559-67False6True0.1666670.111111((1233, 1233), (1272, 1273), (1276, 1278), (12...((11, 12), (17, 17), (21, 21), (29, 41), (54, ...(P21359-2, P01116-2)False9

SWISS-MODEL资源的加入

df5 = sifts_demo.pipe_select_smr_mo(sifts_mo_df=df1).result()
df5
Click to view dataframe
UniProtchainscoordinatescoveragecreated_datefound_bygmqeidentifieridentityisoidligand_chainsmd5methodoligo-stateproviderunp_lensimilaritytemplateunp_rangeacc_agreement_norm_scoreacc_agreement_z_scoreavg_local_scoreavg_local_score_errorcbeta_norm_scorecbeta_z_scoreinteraction_norm_scoreinteraction_z_scorepacking_norm_scorepacking_z_scoreqmean4_norm_scoreqmean4_z_scoreqmean6_norm_scoreqmean6_z_scoress_agreement_norm_scoress_agreement_z_scoretorsion_norm_scoretorsion_z_scoreselect_rankselect_tag
0P21359-2[{"id":"A","segments":[{"smtl":{"aligned_seque...https://swissmodel.expasy.org/repository/unipr...0.0958132020-11-01T18:53:29.485000+00:00BLAST0.0NF1_HUMAN0.9963242[{"count":1,"ligands":[{"description":"(1S)-2-...fc83f587346226e60cc701448008f89fHOMOLOGY MODELLINGmonomerSWISSMODEL28180.6154423p7z.1.A[[1547,1816]]NaNNaN0.7164530.051-0.011142-0.509795-0.020118-0.777705-0.3892430.3020310.767440-0.164094NaNNaN0.597907-0.394816-0.304895-0.1479871False
1P21359-2[{"id":"A","segments":[{"smtl":{"aligned_seque...https://swissmodel.expasy.org/repository/unipr...0.1149752020-11-01T18:53:29.436000+00:00HHblits0.0NF1_HUMAN1.0000002NaNfc83f587346226e60cc701448008f89fHOMOLOGY MODELLINGmonomerSWISSMODEL28180.6121976v6f.1.A[[1205,1528]]NaNNaN0.7252750.052-0.012087-0.252126-0.0230370.008588-0.4007690.5857910.708347-1.885406NaNNaN0.590228-0.466057-0.112651-2.1331392False
2P21359-2[{"id":"A","segments":[{"smtl":{"aligned_seque...https://swissmodel.expasy.org/repository/unipr...0.0489712020-11-01T18:53:29.457000+00:00HHblits0.0NF1_HUMAN0.1594202NaNfc83f587346226e60cc701448008f89fHOMOLOGY MODELLINGmonomerSWISSMODEL28180.2845415owu.1.A[[2221,2358]]NaNNaN0.5087630.0710.007388-3.787026-0.016512-1.403135-0.270749-1.3326740.591839-3.581241NaNNaN0.338587-2.2949930.027482-2.4515823True
3P21359-2[{"id":"A","segments":[{"smtl":{"aligned_seque...https://swissmodel.expasy.org/repository/unipr...0.0468422020-11-01T18:53:29.405000+00:00HHblits0.0NF1_HUMAN0.1068702NaNfc83f587346226e60cc701448008f89fHOMOLOGY MODELLINGmonomerSWISSMODEL28180.2544533woy.1.A[[2268,2399]]NaNNaN0.4236080.0720.011488-4.452434-0.014163-1.706403-0.299886-0.9223070.589486-3.592260NaNNaN0.236340-2.8726750.028206-2.4419664False
4P21359-2[{"id":"L","segments":[{"smtl":{"aligned_seque...https://swissmodel.expasy.org/repository/unipr...0.0493262020-11-01T18:53:29.362000+00:00HHblits0.0NF1_HUMAN0.2148152NaNfc83f587346226e60cc701448008f89fHOMOLOGY MODELLINGmonomerSWISSMODEL28180.2992596ltj.1.L[[1074,1212]]NaNNaN0.4144440.071-0.003353-1.860656-0.016426-1.423716-0.347270-0.3615930.588397-3.625870NaNNaN0.129832-3.5913740.151137-3.2561645True
5P21359-2[{"id":"A","segments":[{"smtl":{"aligned_seque...https://swissmodel.expasy.org/repository/unipr...0.0532292020-11-01T18:53:29.384000+00:00HHblits0.0NF1_HUMAN0.1119402NaNfc83f587346226e60cc701448008f89fHOMOLOGY MODELLINGmonomerSWISSMODEL28180.2601991h2t.1.A[[1986,2135]]NaNNaN0.4097970.0670.008711-4.268426-0.012236-2.127937-0.346139-0.4130690.582163-4.048233NaNNaN0.350606-2.1998050.095252-3.0540756True

pipe_select_smr_mo会根据传入的SIFTS单体选择结果结合SMR提供的model的覆盖范围来判断可用选择哪些模型结构作为补充。

位点映射

  • unp_residue_number是UniProt Isoform对应序列的从1开始计数的索引位置
  • resiude_number是对pdb链从1开始计数的索引
  • author_residue_number是pdb文件作者定义的索引
  • 作者可能会定义author_insertion_code来作为author_residue_number索引的尾缀以区别部分残基

unp_residue_numberauthor_residue_number可能会不一致。而在研究蛋白位点时,不少第三方预测软件需要author_residue_number作为输入,所以将位点统一转为author_residue_number是个重要需求。

一般没法事先知道PDB Chain与UniProt Isoform的位点标号是否一致,这是一个索引映射的问题: 比如UniProt Isoform是100长度,索引就是1,2,..100。而与这个UniProt Isoform匹配上的PDB晶体结构,它的对应匹配上的链是长度为91;作者给这条链的标号是66,67,…156,那么就不好对应上。

下面以P00734与3sqh(chain_id: E)的映射关系为例:

下面两个函数的conflict_pdb_index参数是可选的,不一定需要传入,当您对映射位点上是否存在残基冲突(即PDB链上残基与UniProt Isoform上残基不一致)感兴趣时,才需要传入。
# df1 = SIFTS('P00734').pipe_select_mo().result()
# record = df1[df1.pdb_id.eq('3sqh')].iloc[0]  # df1[df1.select_tag.eq(True)].iloc[0]
df_3sqh, _, _ = SIFTS('3sqh').pipe_score().result()
record = df_3sqh.iloc[0]
record
'''
UniProt                                      P00734
chain_id                                          E
entity_id                                         1
identity                                          1
is_canonical                                   True
pdb_id                                         3sqh
struct_asym_id                                    A
pdb_range                                 [[1,290]]
unp_range                               [[333,622]]
Entry                                        P00734
range_diff                                      [0]
sifts_range_tag                                Safe
repeated                                      False
reversed                                      False
InDel_sum                                         0
new_pdb_range                             [[1,290]]
new_unp_range                           [[333,622]]
conflict_pdb_index                      {"236":"S"}
conflict_pdb_range                      [[236,236]]
conflict_unp_range                      [[568,568]]
unp_len                                         622
OBS_INDEX                    ((1, 181), (188, 290))
OBS_COUNT                                       284
OBS_RATIO_SUM                                 283.9
BINDING_LIGAND_INDEX                             ()
BINDING_LIGAND_COUNT                              0
molecule_type                        polypeptide(L)
ca_p_only                                     False
SEQRES_COUNT                                    290
STD_INDEX                               ((1, 290),)
STD_COUNT                                       290
NON_INDEX                                        ()
NON_COUNT                                         0
UNK_INDEX                                        ()
UNK_COUNT                                         0
ARTIFACT_INDEX                                   ()
OBS_STD_INDEX                ((1, 181), (188, 290))
OBS_STD_COUNT                                   284
RAW_BS                                     0.439318
RAW_BS_IG3                                 0.439318
resolution                                      2.2
experimental_method_class                     x-ray
experimental_method               X-ray diffraction
multi_method                                  False
revision_date                              20120523
deposition_date                            20110705
1/resolution                               0.454545
id_score                                        -69
select_tag                                    False
select_rank                                       8
'''

通过下面代码来得到匹配区间的所有位点映射关系:

PDB(record['pdb_id']).get_expanded_map_res_df(
    record['UniProt'],
    record['new_unp_range'],
    record['new_pdb_range'],
    conflict_pdb_index=record['conflict_pdb_index'],
    struct_asym_id=record['struct_asym_id']).result()
Click to view dataframe
unp_residue_numberresidue_numberUniProtauthor_insertion_codeauthor_residue_numberchain_identity_idmultiple_conformersobserved_ratiopdb_idresidue_namestruct_asym_idconflict_code
03331P00734C1E1NaN1.03sqhGLUANaN
13342P00734B1E1NaN1.03sqhALAANaN
23353P00734A1E1NaN1.03sqhASPANaN
33364P007341E1NaN1.03sqhCYSANaN
43375P007342E1NaN1.03sqhGLYANaN
..........................................
285618286P00734243E1NaN1.03sqhASPANaN
286619287P00734244E1NaN1.03sqhGLNANaN
287620288P00734245E1NaN1.03sqhPHEANaN
288621289P00734246E1NaN1.03sqhGLYANaN
289622290P00734247E1NaN0.93sqhGLUANaN

或者通过如下代码来指定映射位点的映射关系:

PDB(record['pdb_id']).get_map_res_df(
    record['UniProt'],
    record['new_unp_range'],
    record['new_pdb_range'],
    your_sites=(336, 353, 568, 362),
    conflict_pdb_index=record['conflict_pdb_index'],
    struct_asym_id=record['struct_asym_id']).result()
Click to view dataframe
author_insertion_codeauthor_residue_numberchain_identity_idmultiple_conformersobserved_ratiopdb_idresidue_nameresidue_numberstruct_asym_idunp_residue_numberUniProtconflict_code
01E1NaN1.03sqhCYS4A336P00734NaN
1D14E1NaN1.03sqhARG21A353P00734NaN
2M14E1NaN1.03sqhGLY30A362P00734NaN
3195E1NaN1.03sqhALA236A568P00734S

传入的your_sites即对应结果中的unp_residue_number列。

get_map_res_df有一个unp2pdb参数默认为True,表示此函数将把your_sites传入的参数认定为UniProt Isoform上的位点,进而映射至PDB链上。若要让此函数将your_sites传入的参数认定为PDB链上的位点,进而映射至UniProt Isoform上,请设置unp2pdb=False:
PDB(record['pdb_id']).get_map_res_df(
    record['UniProt'],
    record['new_unp_range'],
    record['new_pdb_range'],
    your_sites=(22, 31, 5, 237),
    conflict_pdb_index=record['conflict_pdb_index'],
    unp2pdb=False,
    struct_asym_id=record['struct_asym_id']).result()
Click to view dataframe
author_insertion_codeauthor_residue_numberchain_identity_idmultiple_conformersobserved_ratiopdb_idresidue_nameresidue_numberstruct_asym_idunp_residue_numberUniProtconflict_code
02E1NaN1.03sqhGLY5A337P00734NaN
1E14E1NaN1.03sqhGLU22A354P00734NaN
215E1NaN1.03sqhARG31A363P00734NaN
3196E1NaN1.03sqhGLY237A569P00734NaN

传入的your_sites即对应结果中的residue_number列。

若设置了unp2pdb=False且传入的your_sites中位点是author_residue_number+author_insertion_code,请传入参数author_site=True:
PDB(record['pdb_id']).get_map_res_df(
    record['UniProt'],
    record['new_unp_range'],
    record['new_pdb_range'],
    your_sites=('14E', '2', '14A', '195'),
    conflict_pdb_index=record['conflict_pdb_index'],
    unp2pdb=False,
    author_site=True,
    struct_asym_id=record['struct_asym_id']).result()
Click to view dataframe
author_insertion_codeauthor_residue_numberchain_identity_idmultiple_conformersobserved_ratiopdb_idresidue_nameresidue_numberstruct_asym_idUniProtunp_residue_numberconflict_code
02E1NaN1.03sqhGLY5AP00734337NaN
1A14E1NaN1.03sqhLYS18AP00734350NaN
2E14E1NaN1.03sqhGLU22AP00734354NaN
3195E1NaN1.03sqhALA236AP00734568S

可以看到,如上步骤能够便捷地实现PDBResidue的双向映射。


按照如上教程,您应该已经可以利用pdb-profiling完成不少任务了。若想了解更多其中的编程逻辑、处理逻辑与数据解释,可以继续阅读文档的剩余部分,在那里将会有更为详细的说明。