Giter Site home page Giter Site logo

suderoy / prereq-iaai-19 Goto Github PK

View Code? Open in Web Editor NEW
26.0 3.0 10.0 5.16 MB

Inferring Concept Prerequisite Relations from Online Educational Resources (IAAI-19)

Home Page: https://arxiv.org/abs/1811.12640

License: GNU General Public License v3.0

Python 33.01% C 65.34% Shell 0.35% Makefile 1.29%
siamese-network university-course-dataset generative-model mooc latent-dirichlet-allocation prerequisites online-education-learning

prereq-iaai-19's Introduction

This is the source code for the paper "Inferring Concept Prerequisite Relations from Online Educational Resources", Sudeshna Roy, Meghana Madhyastha, Sheril Lawrence, Vaibhav Rajan. 31st AAAI Conference on Innovative Applications of Artificial Intelligence (IAAI-19). If you use any part of this code cite this paper.

First run the pairwise Link LDA

to learn the /beta matrix (concept vectors). This code is modfied from original LDA code.

cd "pairwise link lda"
make

./lda est 0.01 100 settings.txt "../datasets/NPTEL MOOC Dataset/bow_feature.txt" "../datasets/NPTEL MOOC Dataset/prerequisites.link" random "../datasets/NPTEL MOOC Dataset/k100_a0.01"

Train the siamese network

It does a 10 fold cross-validation on the data.

cd "siamese network"
python siamese_fc_relu.py  "NPTEL MOOC Dataset" "../datasets/NPTEL MOOC Dataset/k100_a0.01 100"

You may change all the above occurence of "NPTEL MOOC Dataset" to "University Course Dataset" to run on University Course Dataset.

To run on your own dataset:

  1. Have the concept vocabulary in a text file as "vocab.txt"

  2. Have all the files in the raw data ready. The files are:

  • cs_courses.csv: These are CS-related course information. Each line is formatted as "<Course_id>,<Course_description>". Note the course titles are located at the begining of the description.

  • cs_edges.csv: There are course prerequisite information. Each line "<course_1>,<course_2>" represents <course_2> is a prerequisite for <course_1>.

  • cs_preqs.csv: These are concept prerequisite pairs. Each line "<Concept_A>,<Concept_B>" represents the prerequisite relationship.

  1. Mention the paths of "cs_preqs.csv" and "vocab.txt" in "data preprocessing/preq_preparation.py" and run which writes the 'concept_vocab.pkl' in the same directory.

    python preq_preparation.py
    

    Add the paths of "cs_course.csv" and "concept_vocab.pkl" in "data preprocessing/create_bow_features.py". Then run

    python create_bow_features.py
    
  2. Run the pairwise Link LDA with correct paths as explained earlier.

  3. Have the train-test splits ready and mention the data paths in the "siamese network/data_processing/siamese_data_train_test.py

  4. Run the siamese network as explained above.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.