processing pipeline Dotaz Zobrazit nápovědu
INTRODUCTION: Recent advances in machine learning provide new possibilities to process and analyse observational patient data to predict patient outcomes. In this paper, we introduce a data processing pipeline for cardiogenic shock (CS) prediction from the MIMIC III database of intensive cardiac care unit patients with acute coronary syndrome. The ability to identify high-risk patients could possibly allow taking pre-emptive measures and thus prevent the development of CS. METHODS: We mainly focus on techniques for the imputation of missing data by generating a pipeline for imputation and comparing the performance of various multivariate imputation algorithms, including k-nearest neighbours, two singular value decomposition (SVD)-based methods, and Multiple Imputation by Chained Equations. After imputation, we select the final subjects and variables from the imputed dataset and showcase the performance of the gradient-boosted framework that uses a tree-based classifier for cardiogenic shock prediction. RESULTS: We achieved good classification performance thanks to data cleaning and imputation (cross-validated mean area under the curve 0.805) without hyperparameter optimization. CONCLUSION: We believe our pre-processing pipeline would prove helpful also for other classification and regression experiments.
- Publikační typ
- časopisecké články MeSH
BACKGROUND: High-throughput bioinformatics analyses of next generation sequencing (NGS) data often require challenging pipeline optimization. The key problem is choosing appropriate tools and selecting the best parameters for optimal precision and recall. RESULTS: Here we introduce ToTem, a tool for automated pipeline optimization. ToTem is a stand-alone web application with a comprehensive graphical user interface (GUI). ToTem is written in Java and PHP with an underlying connection to a MySQL database. Its primary role is to automatically generate, execute and benchmark different variant calling pipeline settings. Our tool allows an analysis to be started from any level of the process and with the possibility of plugging almost any tool or code. To prevent an over-fitting of pipeline parameters, ToTem ensures the reproducibility of these by using cross validation techniques that penalize the final precision, recall and F-measure. The results are interpreted as interactive graphs and tables allowing an optimal pipeline to be selected, based on the user's priorities. Using ToTem, we were able to optimize somatic variant calling from ultra-deep targeted gene sequencing (TGS) data and germline variant detection in whole genome sequencing (WGS) data. CONCLUSIONS: ToTem is a tool for automated pipeline optimization which is freely available as a web application at https://totem.software .
Cryo-electron microscopy has established as a mature structural biology technique to elucidate the three-dimensional structure of biological macromolecules. The Coulomb potential of the sample is imaged by an electron beam, and fast semi-conductor detectors produce movies of the sample under study. These movies have to be further processed by a whole pipeline of image-processing algorithms that produce the final structure of the macromolecule. In this chapter, we illustrate this whole processing pipeline putting in value the strength of "meta algorithms," which are the combination of several algorithms, each one with different mathematical rationale, in order to distinguish correctly from incorrectly estimated parameters. We show how this strategy leads to superior performance of the whole pipeline as well as more confident assessments about the reconstructed structures. The "meta algorithms" strategy is common to many fields and, in particular, it has provided excellent results in bioinformatics. We illustrate this combination using the workflow engine, Scipion.
- MeSH
- algoritmy * MeSH
- elektronová kryomikroskopie metody MeSH
- makromolekulární látky ultrastruktura MeSH
- molekulární biologie metody MeSH
- počítačové zpracování obrazu metody MeSH
- průběh práce MeSH
- výpočetní biologie MeSH
- zobrazení jednotlivé molekuly metody MeSH
- zobrazování trojrozměrné metody MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
BACKGROUND: Next generation sequencing (NGS) technology allows laboratories to investigate virome composition in clinical and environmental samples in a culture-independent way. There is a need for bioinformatic tools capable of parallel processing of virome sequencing data by exactly identical methods: this is especially important in studies of multifactorial diseases, or in parallel comparison of laboratory protocols. RESULTS: We have developed a web-based application allowing direct upload of sequences from multiple virome samples using custom parameters. The samples are then processed in parallel using an identical protocol, and can be easily reanalyzed. The pipeline performs de-novo assembly, taxonomic classification of viruses as well as sample analyses based on user-defined grouping categories. Tables of virus abundance are produced from cross-validation by remapping the sequencing reads to a union of all observed reference viruses. In addition, read sets and reports are created after processing unmapped reads against known human and bacterial ribosome references. Secured interactive results are dynamically plotted with population and diversity charts, clustered heatmaps and a sortable and searchable abundance table. CONCLUSIONS: The Vipie web application is a unique tool for multi-sample metagenomic analysis of viral data, producing searchable hits tables, interactive population maps, alpha diversity measures and clustered heatmaps that are grouped in applicable custom sample categories. Known references such as human genome and bacterial ribosomal genes are optionally removed from unmapped ('dark matter') reads. Secured results are accessible and shareable on modern browsers. Vipie is a freely available web-based tool whose code is open source.
- MeSH
- genetická variace MeSH
- genomika metody MeSH
- internet * MeSH
- lidé MeSH
- mikrobiota genetika MeSH
- software * MeSH
- viry genetika MeSH
- vysoce účinné nukleotidové sekvenování * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
BACKGROUND: Environmental DNA and metabarcoding allow the identification of a mixture of species and launch a new era in bio- and eco-assessment. Many steps are required to obtain taxonomically assigned matrices from raw data. For most of these, a plethora of tools are available; each tool's execution parameters need to be tailored to reflect each experiment's idiosyncrasy. Adding to this complexity, the computation capacity of high-performance computing systems is frequently required for such analyses. To address the difficulties, bioinformatic pipelines need to combine state-of-the art technologies and algorithms with an easy to get-set-use framework, allowing researchers to tune each study. Software containerization technologies ease the sharing and running of software packages across operating systems; thus, they strongly facilitate pipeline development and usage. Likewise programming languages specialized for big data pipelines incorporate features like roll-back checkpoints and on-demand partial pipeline execution. FINDINGS: PEMA is a containerized assembly of key metabarcoding analysis tools that requires low effort in setting up, running, and customizing to researchers' needs. Based on third-party tools, PEMA performs read pre-processing, (molecular) operational taxonomic unit clustering, amplicon sequence variant inference, and taxonomy assignment for 16S and 18S ribosomal RNA, as well as ITS and COI marker gene data. Owing to its simplified parameterization and checkpoint support, PEMA allows users to explore alternative algorithms for specific steps of the pipeline without the need of a complete re-execution. PEMA was evaluated against both mock communities and previously published datasets and achieved results of comparable quality. CONCLUSIONS: A high-performance computing-based approach was used to develop PEMA; however, it can be used in personal computers as well. PEMA's time-efficient performance and good results will allow it to be used for accurate environmental DNA metabarcoding analysis, thus enhancing the applicability of next-generation biodiversity assessment studies.
- MeSH
- Archaea MeSH
- Bacteria MeSH
- environmentální DNA chemie genetika MeSH
- houby MeSH
- metagenomika metody normy MeSH
- referenční standardy MeSH
- respirační komplex IV genetika MeSH
- RNA ribozomální 16S genetika MeSH
- RNA ribozomální 18S genetika MeSH
- rostliny MeSH
- senzitivita a specificita MeSH
- software MeSH
- taxonomické DNA čárové kódování metody normy MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Geib (France) -- Dimensional Modeling: Beyond Data Processing Constraints -- A. SIGNAL AND IMAGE PROCESSING Section 5A. Matsuo 981 -- An Integrated Environment for ECG Processing -- I. Signal Processing ? Payne: MICROPROCESSORS IN MEDICAL RECORD PROCESSING. (New Zealand) 544 -- J.
IFIP world conference series on medical informatics Studies in health technology and informatics
sv. ; 27 cm
- MeSH
- informační systémy MeSH
- lékařská informatika MeSH
- lékařství MeSH
- Publikační typ
- kongresy MeSH
- sborníky MeSH
- Konspekt
- Lékařské vědy. Lékařství
- NLK Obory
- lékařská informatika
This work presents a novel fully automated method for retinal analysis in images acquired with a flood illuminated adaptive optics retinal camera (AO-FIO). The proposed processing pipeline consists of several steps: First, we register single AO-FIO images in a montage image capturing a larger retinal area. The registration is performed by combination of phase correlation and the scale-invariant feature transform method. A set of 200 AO-FIO images from 10 healthy subjects (10 images from left eye and 10 images from right eye) is processed into 20 montage images and mutually aligned according to the automatically detected fovea center. As a second step, the photoreceptors in the montage images are detected using a method based on regional maxima localization, where the detector parameters were determined with Bayesian optimization according to manually labeled photoreceptors by three evaluators. The detection assessment, based on Dice coefficient, ranges from 0.72 to 0.8. In the next step, the corresponding density maps are generated for each of the montage images. As a final step, representative averaged photoreceptor density maps are created for the left and right eye and thus enabling comprehensive analysis across the montage images and a straightforward comparison with available histological data and other published studies. Our proposed method and software thus enable us to generate AO-based photoreceptor density maps for all measured locations fully automatically, and thus it is suitable for large studies, as those are in pressing need for automated approaches. In addition, the application MATADOR (MATlab ADaptive Optics Retinal Image Analysis) that implements the described pipeline and the dataset with photoreceptor labels are made publicly available.
- Publikační typ
- časopisecké články MeSH
The research and development of advanced therapy medicinal products (ATMPs) has been active in Europe and worldwide during recent years. Yet, the number of licensed products remains low. The main expected legal change in the near future in the European Union (EU) concerns the regulation on clinical trials (536/2014), which will come into force in 2018. With this new framework, a more harmonized and swift process for approval of clinical trials is anticipated, which is expected to support the entry of new innovations into the EU market. A survey on ATMPs in clinical trials during 2010-2015 in the EU was conducted in order to study the trends of ATMP development since the earlier survey published in 2012. According to the results, the number of clinical trials using ATMPs is slowly increasing in the EU. Yet, the focus is still in early development, and the projects are mainly carried out by small and medium-sized enterprises, academia, and hospitals. Oncology is the main area of clinical development. Yet, the balance between cell-based products and gene therapy medicinal products in this area may be changing in the future due to the new T-cell technologies. Many limitations and challenges are identified for ATMP development, requiring proportionate regulatory requirements. On the other hand, for such a novel field, the developers should be active in considering possible constraints and actively engage with authorities to look for solutions. This article provides up to-date information on forthcoming regulatory improvements and discusses the main challenges hampering the commercialization of ATMPs in the EU.
- MeSH
- biomedicínský výzkum ekonomika zákonodárství a právo normy MeSH
- Evropská unie MeSH
- farmaceutický průmysl ekonomika zákonodárství a právo normy MeSH
- klinické zkoušky jako téma ekonomika zákonodárství a právo normy MeSH
- transfer technologií * MeSH
- Publikační typ
- časopisecké články MeSH
- přehledy MeSH
SUMMARY: Here we introduce a Fiji plugin utilizing the HPC-as-a-Service concept, significantly mitigating the challenges life scientists face when delegating complex data-intensive processing workflows to HPC clusters. We demonstrate on a common Selective Plane Illumination Microscopy image processing task that execution of a Fiji workflow on a remote supercomputer leads to improved turnaround time despite the data transfer overhead. The plugin allows the end users to conveniently transfer image data to remote HPC resources, manage pipeline jobs and visualize processed results directly from the Fiji graphical user interface. AVAILABILITY AND IMPLEMENTATION: The code is distributed free and open source under the MIT license. Source code: https://github.com/fiji-hpc/hpc-workflow-manager/, documentation: https://imagej.net/SPIM_Workflow_Manager_For_HPC. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
SECTOR IN THE CZECH REPUBLIC 61 -- 3.1 Oil Infrastructure in the Czech Republic 61 -- 3.1.1 Oil Pipeline Infrastructure 113 -- 5.2 The Ingolstadt-Kralupy-Litvlnov Pipeline (IKL) .116 -- 5.3 The Adria Pipeline .125 -- 5.4 The Potential Bratislava-Schwechat Pipeline (BSP) -- and Adria-Wien Pipeline (AWP) .136 -- 5.5 The Potential Odessa-Brody-damowo-Plock-Gdansk Pipeline 146 -- 5.6 The Potential Spergau-Litvinov Pipeline .161 -- 5.7 Lobau-Bratislava Waterway .170 -- CHAPTER 6: RESULTS .174 -- CHAPTER 7: CONCLUSION
1. elektronické vydání 1 online zdroj (204 stran)
Od roku 2007 se často opakuje informace o tom, že Rusko zvažuje uzavření ropovodu Družba. Česká a Slovenská republika to vnímají jako významnou hrozbu. Tato kniha hodnotí dostupné infrastrukturní alternativy a poskytuje jakési vodítko pro budoucí řešení problému. Cílem je analyzovat možnosti ropovodní infrastruktury s ohledem na alternativy k primární zásobovací trase, a to pro Českou a Slovenskou republiku. Těchto možností je šest: ropovod Ingolstadt-Kralupy-Litvínov, ropovod Adria, potenciální ropovod Bratislava-Schwechat-Pipelie a ropovod Adria-Wien Pipeline, potenciální ropovod Oděsa-Brody-Adamowo-Płock-Gdaňsk, potenciální ropovod Spergau-Litvínov a vodní cesta Lobau-Bratislava.Výsledky výzkumu ukazují, že Česká a Slovenská republika mají společné zájmy v ropném sektoru, které mohou přetavit ve společný postup při jejich dosahování.