Training courses: School of Python for genomics Massimiliano Orsini. Intermediate module: Analysing your genomic data with Python
Course Description
The use of Next-Generation Sequencing (NGS) has exponentially increased in the last two decades, both in the field of research and diagnostics. This technology allows obtaining millions of sequences (‘reads’) in a single experiment, making it possible to reconstruct complete genomes of pathogens relevant to public health in just a few hours.
By that motivation, the Società Italiana di Diagnostica di Laboratorio Veterinaria (SIDiLV) and the Italian Infrastructure of Bioinformatics (IIB/ELIXIR Italy) organised the School of Python for genomics at the Italian National Institute of Health (ISS).
The School aims to provide the necessary skills to utilise the Python language both for the use and development of software and for genomics data analysis in general.
The whole School consists of 3 consequential modules:
- Basic module: Introduction to Python programming (Nov 2022)
- Intermediate module: Analysing your genomic data with Python (Jan-Feb 2024)
- Hackathon module: Python programming for developers
The main objective of this second Module titled ‘Intermediate Module: Analysing your genomics data with Python’ will be to characterise a population of Escherichia coli strains through the analysis of Whole Genome Sequencing (WGS) data, identifying any groups of isolates that share epidemiological and/or genomic characteristics and the presence of specific virulence traits.
This module aims to achieve the following objectives:
- identify the distribution of serogroups of isolates
- analyse the presence and distribution of antibiotic-resistance genes
- examine the presence and distribution of virulence genes
- perform isolate typing (using methods such as MLST and cgMLST) and identify any correlations.
The course involves the development of a Python pipeline capable of processing raw data obtained from Illumina and/or Ion Torrent NGS.
Important Dates
- Deadline for applications: 9 January, 2024
- School date: 30 January - 2 February, 2024
Venue
Sapienza University of Rome
P.le Aldo Moro, 5
00195 Rome, Italy
Fee
No registration fee is required, but those who are selected will be expected to pay the SIDiLV membership fee for the year 2024 afterward (Full membership fee: 50 euros; Reduced membership fee for individuals under 35 years of age: 30 euros).
Participants are expected to pay their travel, meals, and hotel costs (if any).
Registration
A maximum of 16 candidates will be selected based on a first-come, first-served basis, and the pre-requisites requested for attending the course, as emerging from the application form.
Notifications of acceptance will be sent by January 10th.
Instructors
- Luca De Sabato - Dep. of Food Safety, Nutrition and Veterinary Public Health, National Institute of Health (SANV - ISS), Italy
- Arnold Knijn - Dep. of Food Safety, Nutrition and Veterinary Public Health, National Institute of Health (SANV - ISS), Italy
- Loredana Le Pera - Core Facilities Technical-Scientific Service, National Institute of Health (FAST - ISS), Italy
- Valeria Michelacci - Dep. of Food Safety, Nutrition and Veterinary Public Health, National Institute of Health (SANV - ISS), Italy
- Allegra Via - Sapienza University of Rome, Italy
Helpers
- Gianluca Frustagli - Core Facilities Technical-Scientific Service, National Institute of Health (FAST - ISS), Italy
- Chiara Pacelli - Institute for Biomedical Technologies of National Research Council (CNR-ITB), Italy
- Irene Ruspantini - Core Facilities Technical-Scientific Service, National Institute of Health (FAST - ISS), Italy
Prerequisites
This Intermediate Module is designed as a follow-up to the Basic Module of the School of Python, hence it assumes a basic understanding of programming and prior programming experience in Python as prerequisites. This Module will primarily focus on practical applications rather than theoretical concepts.
Target audience
Research scientists at any stage of their career who are using (or planning to use) genomic data in their research activities are welcome to attend the workshop.
Learning Outcomes
At the end of the course, the student will be able to:
- critically assess the quality of NGS data
- develop a complete pipeline for genomic analysis (starting from raw data to obtain final results).
In general, after completing the entire module, the student will have a greater awareness of both the use of Python and the impact that genomic analysis can have in a professional and social context.
Organisers
SIDiLV
ELIXIR Italy
Stefano Morabito - SIDiLV/ELIXIR Italy/National Institute of Health (ISS), Italy,
Valeria Michelacci - SIDiLV/National Institute of Health (ISS), Italy,
Loredana Le Pera - ELIXIR Italy (Training Platform)/National Institute of Health (ISS), Italy,
Allegra Via - ELIXIR Italy (Training Platform)/Sapienza Univ. of Rome, Italy
Contact
For all kinds of queries, contact us at elixir.ita.training@gmail.com
Programme
Tuesday - 30 January |
|
13:00 - 14:00 | Setup |
14:00 - 14:15 | Welcome |
14:15 - 14:45 | Introducing the topic |
14:45 - 16:30 | Python Recap (Exercises) |
16:30 - 17:00 | Break |
17:00 - 17:30 | Genomics: Introduction to the experiment and data (Lecture) |
Wednesday - 31 January |
|
09:00 - 09:30 | Warm-up |
09:30 - 10:15 | Raw data Quality Control |
10:15 - 11:00 | Python: Introduction to the Pipeline (Lecture) |
11:00 - 11:30 | Break |
11:30 - 12:30 | Python: Developing the Pipeline (Hands-on) |
12:30 - 14:00 | Lunch |
14:00 - 15:30 | Python: Developing the Pipeline (Hands-on) |
15:30 - 16:00 | Break |
16:00 - 17:00 | Python: Developing the Pipeline (Hands-on) |
17:00 - 17:30 | Wrap-up |
Thursday - 1 February |
|
09:00 - 09:30 | Warm-up |
09:30 - 10:30 | Python: Developing the Pipeline (Lecture + Hands-on) |
10:30 - 11:00 | Break |
11:00 - 12:30 | Python: Developing the Pipeline (Hands-on) |
12:30 - 14:00 | Lunch |
14:00 - 14:30 | Python: Developing the Pipeline (Hands-on) |
14:30 - 15:30 | Python: Developing the Pipeline (Hands-on) |
15:30 - 16:00 | Break |
16:00 - 17:00 | Python: Developing the Pipeline (Hands-on) |
17:00 - 17:30 | Wrap-up |
Friday - 2 February |
|
09:00 - 09:30 | Warm-up |
09:30 - 10:30 | Python: Graphs production (Lecture + Hands-on) |
10:30 - 11:00 | Break |
11:00 - 12:30 | Python: Graphs production (Hands-on) |
12:30 - 13:30 | Lunch |
13:30 - 15:00 | Python: Graphs production (Hands-on) |
15:00 - 15:30 | Critical Analysis of Results |
15:30 - 16:00 | Final Discussion: some examples |
16:00 - 16:30 | Introduction to Python classes (Special Lecture) |
16:30 - 17:00 | Conclusion and Feedback questionnaire |