Principles Of Computer Science 22022
Second Year / First Semester Course L-2 - 1049261
(A. A. 2022/2023)
Syllabus
The course aim to introduce computational thinking and the algorithmic approach to solving problems correctly and efficiently. Algorithms are ubiquitous in bioinformatics and are often at the interface of computer science and biology. Well established algorithmic techniques will be studied as well as ways to encode them in a computer program using python.
We will introduce the algorithmic approach and the theory of algorithms for studying correctness and efficiency, understanding what makes a good algorithm and how to classify them.
We will study characteristic algorithmic techniques and the relate-d computational ideas that are relevant to the field of biology and how to select the most suitable to solve a given task. Topics covered include
- Searching algorithms
- Divide-and-Conquer algorithms
- Clustering and Tree-based algorithms
We will work with Python and how to write a computer program encoding a given algorithm. We will work with Amazon's AWS and how to use cloud resources to efficiently execute our python programs on large datasets.
Location
All classes take place in Classroom Psicologia I, Fisiologia Generale e Antropologia Farmacia e Medicina (CU026, E01PSIL101)
Time Schedule
- Monday 16:00 - 19:00
- Thursday 14:00 - 16:00
Contact & Discussions
A slack channel is available at the following URL: https://pcs2022-workspace.slack.com
ASSIGNMENTS
A total of five assignments will be handed over. These assignments are done by each student individually. Clearly you should discuss with other students of the course about the assignments. However, you must understand well your solutions and the final writeup must be yours and written in isolation. In addition, even though you may discuss about how you could implement an algorithm, what type of libraries to use, and so on, the final code must be yours. You may also consult the internet for information, as long as it does not reveal the solution. If a question asks you to design and implement an algorithm for a problem, it's fine if you find information about how to resolve a problem with character encoding, for example, but it is not fine if you search for the code or the algorithm for the problem you are being asked. For the projects, you can talk with other students of the course about questions on the programming language, libraries, some API issue, and so on, but both the solutions and the programming must be yours. If we find out that you have violated the policy and you have copied in any way you will automatically fail. If you have any doubts about whether something is allowed or not, ask the instructor.
- 1st Assignment
Deadline: 24th October 2022
- 2nd Assignment
Deadline: 7th November 2022
- 3rd Assignment
Deadline: 21st November 2022
- 4th Assignment
Deadline: 12th December 2022
- 5th Assignment
Deadline: 16th January 2023
Lecture Material
- Lecture 1: Monday, October 3, 2022. Lecture Slides in PDF
- Lecture 2: Thursday, October 6, 2022. Lecture Slides in PDF
- Lecture 3: Monday, October 10, 2022. Lecture Slides in PDF
- Lecture 4: Thursday, October 13, 2022. Lecture Slides in PDF
- Lecture 5: Monday, October 17, 2022. Lecture Slides in PDF
- Recursive Functions
- Python Programming: An Introduction to Computer Science (Third Edition) by John M. Zelle
- Chapter 13 Algorithm Design & Recursion
- Sample Programs
- Python Timer Functions: Three Ways to Monitor Your Code
- iPython Magic Commands
- Big O Notation and Time Complexity
- Lecture 6: Thursday, October 20, 2022. Lecture Slides in PDF
- Chapter 6: Dynamic Programming, Problem 6.1: Equivalent Words Problem
- Second Assignment
- Lecture 7: Monday, October 24, 2022. Lecture Slides in PDF
- Lecture 8: Thursday, October 27, 2022. Lecture Slides in PDF
- Lecture 9: Thursday, November 3, 2022. Lecture Slides in PDF
- Lecture 10: Monday, November 7, 2022. Lecture Slides in PDF
- Lecture 11: Thursday, November 10, 2022. Invited Lecture by Prof. Mohamed Elhadi Rahmani, Department of Computer Science, University of Dr.MOULAY TAHAR - Saida - Algeria, on Basics of Machine Learning and Neural Networks.
- Lecture 12: Monday, November 14, 2022. Lecture Slides in PDF
- Lecture 13: Thursday, November 17, 2022. Lecture Slides in PDF
- Lecture 14: Monday, November 21, 2022.
- Hierarchical Clustering in scikit learn
- DBSCAN in scikit learn
- Fourth Assignment
- Lecture 15: Thursday, November 24, 2022. Lecture Slides in PDF
- Lecture 16: Monday, November 28, 2022. Lecture Slides in PDF
- Lecture 17: Thursday, December 1, 2022.
- Lecture 18: Monday, December 5, 2022. Lecture Slides in PDF
- Lecture 19: Thursday, December 8, 2022.
- Lecture 20: Monday, December 12, 2022. Lecture Slides in PDF
- K-mer counting using PySpark
- Fifth Assignment
- Lecture 21: Thursday, December 15, 2022.
- Lecture 22: Monday, December 19, 2022.
- Lecture 23: Thursday, December 22, 2022.
Coding Material
The material related to python that was presented in class is available from as an open-source repository in GitHub.
References
- NEIL C. JONES AND PAVEL A. PEVZNER: An Introduction to Bioinformatics Algorithms. A Bradford Book, The MIT Press, Cambridge, Massachusetts, London, England, 2004.
- JOHN M. ZELLE: Python Programming: An Introduction to Computer Science (Third Edition)
- Jeff Chang, Brad Chapman, Iddo Friedberg, Thomas Hamelryck, Michiel de Hoon, Peter Cock, Tiago Antao, Eric Talevich, Bartek WilczyĆski: Biopython Tutorial and Cookbook