JavaScript NENÍ povolen !

Prosím povolte JavaScript.

Článek

Medvik - BMČ

Je něco špatně v tomto záznamu ?

Improving the quality of protein identification in non-model species. Characterization of Quercus ilex seed and Pinus radiata needle proteomes by using SEQUEST and custom databases

MC. Romero-Rodríguez, J. Pascual, L. Valledor, J. Jorrín-Novo,

Romero-Rodríguez, M Cristina
Autor Romero-Rodríguez, M Cristina Agricultural and Plant Biochemistry and Proteomics Research Group, Dept. of Biochemistry and Molecular Biology, University of Córdoba, Spain
Pascual, Jesús
Autor Pascual, Jesús Plant Physiology, Faculty of Biology, Dept. of Organisms and Systems Biology, University of Oviedo, Spain
Valledor, Luis
Autor Valledor, Luis Dept. of Biology & Centre for Environmental and Marine Studies, University of Aveiro, Aveiro, Portugal GCRC, Adaption Biotechnologies, Academy of Sciences of the Czech Republic, Brno, Czech Republic. Electronic address: luis@valledor.info
Jorrín-Novo, Jesús
Autor Jorrín-Novo, Jesús Agricultural and Plant Biochemistry and Proteomics Research Group, Dept. of Biochemistry and Molecular Biology, University of Córdoba, Spain. Electronic address: bf1jonoj@uco.es

Journal of proteomics. 2014 ; 105 (-) : 85-91.

J Proteomics
ISSN 1876-7737
Medvik
Zdroj

Jazyk angličtina Země Nizozemsko

Typ dokumentu dataset, časopisecké články, práce podpořená grantem

Perzistentní odkaz https://www.medvik.cz/link/bmc15014518

PubMed 24508333
DOI 10.1016/j.jprot.2014.01.027
Knihovny.cz E-zdroje

MeSH
borovice genetika metabolismus MeSH
databáze proteinů * MeSH
dub (rod) genetika metabolismus MeSH
proteom genetika metabolismus MeSH
proteomika metody MeSH
rostlinné proteiny genetika metabolismus MeSH
sekvence aminokyselin MeSH
sekvence nukleotidů MeSH
sekvenční analýza proteinů metody MeSH
sekvenční analýza RNA metody MeSH
semena rostlinná genetika metabolismus MeSH
Publikační typ
časopisecké články MeSH
dataset MeSH
práce podpořená grantem MeSH

UNLABELLED: Nowadays the most used pipeline for protein identification consists in the comparison of the MS/MS spectra to reference databases. Search algorithms compare obtained spectra to an in silico digestion of a sequence database to find exact matches. In this context, the database has a paramount importance and will determine in a great deal the number of identifications and its quality, being this especially relevant for non-model plant species. Using a single Viridiplantae database (NCBI, UniProt) and TAIR is not the best choice for non-model species since they are underrepresented in databases resulting in poor identification rates. We demonstrate how it is possible to improve the rate and quality of identifications in two orphan species, Quercus ilex and Pinus radiata, by using SEQUEST and a combination of public (Viridiplantae NCBI, UniProt) and a custom-built specific database which contained 593,294 and 455,096 peptide sequences (Quercus and Pinus, respectively). These databases were built after gathering and processing (trimming, contiging, 6-frame translation) publicly available RNA sequences, mostly ESTs and NGS reads. A total of 149 and 1533 proteins were identified from Quercus seeds and Pinus needles, representing a 3.1- or 1.5-fold increase in the number of protein identifications and scores compared to the use of a single database. Since this approach greatly improves the identification rate, and is not significantly more complicated or time consuming than other approaches, we recommend its routine use when working with non-model species. BIOLOGICAL SIGNIFICANCE: In this work we demonstrate how the construction of a custom database (DB) gathering all available RNA sequences and its use in combination with Viridiplantae public DBs (NCBI, UniProt) significantly improve protein identification when working with non-model species. Protein identification rate and quality is higher to those obtained in routine procedures based on using only one database (commonly Viridiplantae from NCBI), as we demonstrated analyzing Quercus seeds and Pine needles. The proposed approach based on the building of a custom database is not difficult or time consuming, so we recommend its routine use when working with non-model species. This article is part of a Special Issue entitled: Proteomics of non-model organisms.

Agricultural and Plant Biochemistry and Proteomics Research Group Dept of Biochemistry and Molecular Biology University of Córdoba Spain

Dept of Biology and Centre for Environmental and Marine Studies University of Aveiro Aveiro Portugal

GCRC Adaption Biotechnologies Academy of Sciences of the Czech Republic Brno Czech Republic

Plant Physiology Faculty of Biology Dept of Organisms and Systems Biology University of Oviedo Spain

Citace poskytuje Crossref.org

000: 00000naa a2200000 a 4500

001: bmc15014518

003: CZ-PrNML

005: 20150428102652.0

007: ta

008: 150420s2014 ne f 000 0|eng||

009: AR

024 7_: $a 10.1016/j.jprot.2014.01.027 $2 doi

035 __: $a (PubMed)24508333

040 __: $a ABA008 $b cze $d ABA008 $e AACR2

041 0_: $a eng

044 __: $a ne

100 1_: $a Romero-Rodríguez, M Cristina $u Agricultural and Plant Biochemistry and Proteomics Research Group, Dept. of Biochemistry and Molecular Biology, University of Córdoba, Spain.

245 10: $a Improving the quality of protein identification in non-model species. Characterization of Quercus ilex seed and Pinus radiata needle proteomes by using SEQUEST and custom databases / $c MC. Romero-Rodríguez, J. Pascual, L. Valledor, J. Jorrín-Novo,

520 9_: $a UNLABELLED: Nowadays the most used pipeline for protein identification consists in the comparison of the MS/MS spectra to reference databases. Search algorithms compare obtained spectra to an in silico digestion of a sequence database to find exact matches. In this context, the database has a paramount importance and will determine in a great deal the number of identifications and its quality, being this especially relevant for non-model plant species. Using a single Viridiplantae database (NCBI, UniProt) and TAIR is not the best choice for non-model species since they are underrepresented in databases resulting in poor identification rates. We demonstrate how it is possible to improve the rate and quality of identifications in two orphan species, Quercus ilex and Pinus radiata, by using SEQUEST and a combination of public (Viridiplantae NCBI, UniProt) and a custom-built specific database which contained 593,294 and 455,096 peptide sequences (Quercus and Pinus, respectively). These databases were built after gathering and processing (trimming, contiging, 6-frame translation) publicly available RNA sequences, mostly ESTs and NGS reads. A total of 149 and 1533 proteins were identified from Quercus seeds and Pinus needles, representing a 3.1- or 1.5-fold increase in the number of protein identifications and scores compared to the use of a single database. Since this approach greatly improves the identification rate, and is not significantly more complicated or time consuming than other approaches, we recommend its routine use when working with non-model species. BIOLOGICAL SIGNIFICANCE: In this work we demonstrate how the construction of a custom database (DB) gathering all available RNA sequences and its use in combination with Viridiplantae public DBs (NCBI, UniProt) significantly improve protein identification when working with non-model species. Protein identification rate and quality is higher to those obtained in routine procedures based on using only one database (commonly Viridiplantae from NCBI), as we demonstrated analyzing Quercus seeds and Pine needles. The proposed approach based on the building of a custom database is not difficult or time consuming, so we recommend its routine use when working with non-model species. This article is part of a Special Issue entitled: Proteomics of non-model organisms.

650 _2: $a sekvence aminokyselin $7 D000595

650 _2: $a sekvence nukleotidů $7 D001483

650 12: $a databáze proteinů $7 D030562

650 _2: $a borovice $x genetika $x metabolismus $7 D028223

650 _2: $a rostlinné proteiny $x genetika $x metabolismus $7 D010940

650 _2: $a proteom $x genetika $x metabolismus $7 D020543

650 _2: $a proteomika $x metody $7 D040901

650 _2: $a dub (rod) $x genetika $x metabolismus $7 D029963

650 _2: $a semena rostlinná $x genetika $x metabolismus $7 D012639

650 _2: $a sekvenční analýza proteinů $x metody $7 D020539

650 _2: $a sekvenční analýza RNA $x metody $7 D017423

655 _2: $a dataset $7 D064886

655 _2: $a časopisecké články $7 D016428

655 _2: $a práce podpořená grantem $7 D013485

700 1_: $a Pascual, Jesús $u Plant Physiology, Faculty of Biology, Dept. of Organisms and Systems Biology, University of Oviedo, Spain.

700 1_: $a Valledor, Luis $u Dept. of Biology & Centre for Environmental and Marine Studies, University of Aveiro, Aveiro, Portugal; GCRC, Adaption Biotechnologies, Academy of Sciences of the Czech Republic, Brno, Czech Republic. Electronic address: luis@valledor.info.

700 1_: $a Jorrín-Novo, Jesús $u Agricultural and Plant Biochemistry and Proteomics Research Group, Dept. of Biochemistry and Molecular Biology, University of Córdoba, Spain. Electronic address: bf1jonoj@uco.es.

773 0_: $w MED00166847 $t Journal of proteomics $x 1876-7737 $g Roč. 105, č. - (2014), s. 85-91

856 41: $u https://pubmed.ncbi.nlm.nih.gov/24508333 $y Pubmed

910 __: $a ABA008 $b sig $c sign $y a $z 0

990 __: $a 20150420 $b ABA008

991 __: $a 20150428102955 $b ABA008

999 __: $a ok $b bmc $g 1072099 $s 897396

BAS __: $a 3

BAS __: $a PreBMC

BMC __: $a 2014 $b 105 $c - $d 85-91 $i 1876-7737 $m Journal of proteomics $n J Proteomics $x MED00166847

LZP __: $a Pubmed-20150420

Najít záznam

v PubMed

Citační ukazatele

Pouze přihlášení uživatelé

Improving the quality of protein identification in non-model species. Characterization of Quercus ilex seed and Pinus radiata needle proteomes by using SEQUEST and custom databases

Najít záznam

Citační ukazatele

Možnosti archivace