GEO Uploader: Simplifying the data deposition in the GEO repository
Format: In person
Moderator(s): Madelaine Gogol
Authors List: Show
- Ronald Domi, University of Zurich, Switzerland
- Falko Noé, ETH Zurich / University of Zurich, Switzerland
- Peter Leary, ETH Zurich / University of Zurich, Switzerland
- Hubert Rehrauer, ETH Zurich / University of Zurich, Switzerland
Presentation Overview: Show
Introduction
Making data FAIR is a key step in every research project. For NGS data there are the GEO and ENA repositories that provide long term storage and open access, and are widely adopted in the research community. Transferring the data and compiling the meta-information appropriately is however still a manual activity which may be cumbersome for massive NGS data.
Methods
We implemented a Python-based web application that performs the data upload for users and compiles the meta-information in an appropriate way. The GEO Uploader can be run standalone but is in our environment tightly integrated with our SUSHI web framework for reproducible, web-based analysis of sequencing data.
Results
The GEO Uploader is running at our center at https://geo-uploader.fgcz.uzh.ch/ and so far already close to 50 datasets have been uploaded to GEO. The uploader collects the files, generates MD5 sums, transfers the data and compiles the Excel table that is needed to provide the meta-information. It fills protocol information automatically based on the data and let’s users enter other information through a convenient web interface. It currently supports bulk RNA-seq as well as single cell RNA-seq data.
Discussion
Our GEO Uploader contributes to the community-wide adoption of Open Resarch Data (ORD) best practices. The GEO Uploader invites researchers to make data available early in the research process and it simplifies and speeds up this step.
Enhancing Bioinformatics Workflows with Analytical Visualization Tools
Confirmed Presenter: Carlos Prieto, Bioinformatics Service, Nucleus. University of Salamanca., Spain
Format: In person
Moderator(s): Madelaine Gogol
Authors List: Show
- Carlos Prieto, Bioinformatics Service, Nucleus. University of Salamanca., Spain
- David Barrios, Bioinformatics Service, Nucleus. University of Salamanca., Spain
Presentation Overview: Show
Current front end development technologies have enabled the development of new visual analytics tools. These methodologies allow data to be visualized in a web browser in an interactive and dynamic way. The development of new visualization tools is essential for the effective exploration and interpretation of datasets and results produced by bioinformatics analysis techniques.
This work presents programming methodologies and analytical visualization solutions that have been applied applied to the analysis of high-throughput sequencing data. Their development has been carried out using new web visualization technologies and the creation of a development architecture called LAMPR (Linux, Apache, MySQL, PHP, R). The following bioinformatics tools will be presented:
- RJSplot: A collection of 17 interactive plots implemented in R.
- D3GB: An interactive genome browser.
- Looking4clusters: A tool for the interactive visualization of single-cell data.
- Rvisdiff: Analytical visualization of differential expression results.
- RaNA-Seq: A web-based platform for the analysis and visualization of RNA-Seq data.
- MutationMinning: A self-analytical interface for the exploration of DNA resequencing results.
The use of interactive and dynamic visualization tools enhances the interpretation of complex datasets and enables study designers or wet lab members to work toward a deeper understanding of their data.
Competency framework profiles to reflect career progression within bioinformatics core facility scientists
Format: In person
Moderator(s): Madelaine Gogol
Authors List: Show
- Patricia Carvajal-López, EMBL's European Bioinformatics Institute, United Kingdom
- Marta Lloret-Linares, EMBL's European Bioinformatics Institute, United Kingdom
- Cath Brooksbank, EMBL's European Bioinformatics Institute, United Kingdom
Presentation Overview: Show
There is an expanding need for specialised services from Bioinformatics Core Facilities (BCF). Providing services for these infrastructures requires highly trained specialists; however, their ill- defined career pathways and their highly specialised skill set often hinder their efforts to progress in their professions.
To address this challenge, members of the ISCB’s Bioinfo-Core group, the Curriculum Task Force of the ISCB Education Committee, and other interested individuals joined forces at the 2023 ISMB Bioinfo-Core meeting to create the ‘Bioinformatics Core Facility Scientists Competencies Taskforce’ (https://sites.google.com/ebi.ac.uk/bioinfocore-competencies). The taskforce worked to provide a benchmark for reflecting the knowledge, skills and attitudes required by professionals in BCFs, and to provide a potential template for career progression for BCF scientists.
This benchmark was developed as an extension of the ISCB Competency Framework (https://competency.ebi.ac.uk), which defines a ‘minimum standard’ for a generic, mid-career BCF scientist (and for several other distinct career profiles). The outcome of this work was the addition of six BCF scientist-focused competencies (project management, people management, collaborator engagement, users and service, training, and leadership) to the thirteen that already exist. We also created four different professional profiles for BCF scientists, outlining a potential transition from entry level to a managerial role.
The development of a well-defined, competency-based career pathway, along with training for this community, is essential to support career progression of BCF specialists who, in return, enable research and development within the life sciences.