Giter Site home page Giter Site logo

uis-dat640-fall2020's Introduction

Information Retrieval and Text Mining course (DAT640), University of Stavanger, fall 2020

Course format

The course follows a hybrid format, where lecture videos are provided online and classroom time is used for discussion, exercises, and working on assignments.

This course involves self-study (which can be completed online): You're expected to watch the lecture videos, read the corresponding book chapters/sections listed on the last slide of each lecture deck, as well as complete the exercises on GitHub.

There is also a physical component which is not obligatory, but highly recommended for an optimal learning experience. This involves discussion and exercises in a regular classroom setting.

There is a double lecture on Mondays, 14:15-16:00 (without dividing into A/B groups). The Tuesday 14:15-16:00 slot is kept for open office hours (in KE E-433). Note that open office hours are meant for questions regarding the lecture material and/or exercises. Issues related to the assignments should be addressed at lab sessions on Wednesdays.

The semester is divided into lecture and group project work periods, with an (optional) trial exam in between.

Lecture period (weeks 35-42)

  • Mondays and Tuesdays are classes for discussion and exercises, led by Krisztian Balog (KB).
    • The respective video lectures are made available either before or after the class (depending on the topic). If the video is made available before the class, you are expected to watch it, in order to be able to meaningfully participate in the discussion and undertake the exercises.
  • Wednesdays are labs for getting help on the obligatory assignments, led by Trond Linjordet (TL).
    • Assignments are to be solved individually. They constitute 50% the project work.

Trial exam (week 43)

A trial exam will be made available mirroring the same setup that will be used at the final exam. The exam will not be corrected, but the answer key will be shared.

Group project work (weeks 44-46)

Students will need to complete a project in groups of 2-3, and write a report that will be graded.

There will be a pool of options to select a project from. During the project period, each group can request a 15mins dedicated weekly discussion slot with the lecturer during the class hours (Mon/Tue), and can get feedback on the draft report from the teaching assistant during lab hours (Wed). Both in-person and remote (Zoom) options will be available.

Lectures and material

Date Topic Lectures Exercises
Aug 24 L1: Welcome and introduction slides exercises, solutions
Aug 25 L2: Text classification slides, video lecture exercises, solutions
Aug 31 L3: Text preprocessing slides, video lecture exercises, solutions
Sep 1 L4: Text classification evaluation slides, video lecture exercises, solutions
Sep 7 L5: Text classification: Naive Bayes slides, video lecture exercises
L6: Text clustering (slides to be added) exercises
Sep 21 L7: Search engine architecture slides, video lecture exercises, solutions
L8: Indexing and query processing slides, video lecture
L9: Retrieval evaluation slides, video lecture
Sep 28 L10: Retrieval models slides, video lecture exercises, solutions
L11: Query modeling slides, video lecture
L12: Web search slides, video lecture
Oct 5 L13: Semantic search: Entity-oriented search slides, † exercises, solutions
L14: Semantic search: Entity retrieval slides, †
L15: Semantic search: Entity linking slides, †
Elasticsearch code
Oct 12 L16: Learning to rank slides, video lecture, code exercises, solutions
L17: Neural IR (invited) slides, video lecture
L18: Table retrieval (invited) slides, video lecture

Lectures L13-15 closely follow book chapters 1, 2, 3 and 5 of the Entity-Oriented Search book (available open access), therefore, no video lectures are made for them. Check the chapters for a detailed narrative.

Grading

The overall grade comes from two components:

  • Project work (40%), of which
  • Written exam (60%)

Note that the project grade needs to be >F in order to pass the course.

Contact and getting help

  • For all course-related matters, the primary contact email is [email protected]
  • Wednesday labs are for working on the assignments. This is the time to get help!
  • If you need to talk to the lecturer, make an appointment via email. No drop-ins unannounced!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.