Giter Site home page Giter Site logo

atp_kurssi's Introduction

ATP_kurssi

This page includes all the materials for the course KKLT0030 Automatic text processing 5 credits.

The course Moodle page has private materials, such as possible recordings and announcements: https://moodle.utu.fi/course/view.php?id=29596

Mon Oct 23

  • Getting started
  • Notebook 1
  • Commands
    • Getting data and printing stuff: wget, echo
    • Printing files: cat, head, tail
    • Copying, renaming, removing: cp, rm, mv
    • Others: wc -w, ls

Thur Oct 26

  • Notebook2
  • Commands: egrep, sort, uniq
  • Options
    • egrep -v, -i, -w, -c, -B, -A
    • head -n, tail -n
    • wc -l, -w
    • uniq -c, sort -r, -n
  • Pipes, especially frequency counts
    • sort | uniq -c | sort -rn

Mon Oct 30

  • Notebook3 exercises

Thur Nov 2

  • Notebook4
  • Git clone for cloning Github reports
  • Gzipped files using gzip and zcat
  • Changing characters using tr
    • Combining tr to a frequency list pipeline
    • Using tr to normalize
  • Regular expressions

Mon Nov 6

  • Notebook 5 exercies

Thur Nov 9

  • Notebook 6
  • Dependency syntax analysis pipeline
  • Sentence + token segmentation, lemmatisation, POS, dependencies
  • conllu format
  • Universal dependencies treebanks
  • Trankit parser

Mon Nov 13

  • Notebook 7
  • Running python scripts

Thur Nov 16

  • Notebook 8
  • Working on the server (Note that the exam will be on server!)

Mon Nov 20

  • Notebook 8 cont'd
  • Scripts

Thur Nov 23

  • Notebook 9

Mon Nov 27

  • Notebook 9

Thur Nov 30

  • Notebook 10
  • For loops

Mon Dec 4

Thur Dec 7

  • Exam, option 1
  • 14.00-16.00 (TBA)

Thur Dec 14

  • Exam, option 2
  • 14.00-16.00 (TBA)

atp_kurssi's People

Contributors

mavela avatar selcene avatar jmnybl avatar shellstroem avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.