I-TUTOR Maps Exploring the theoretical background

Similar documents
Probabilistic Latent Semantic Analysis

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Automating the E-learning Personalization

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Linking Task: Identifying authors and book titles in verbose queries

Python Machine Learning

AQUA: An Ontology-Driven Question Answering System

Word Segmentation of Off-line Handwritten Documents

Lecture 1: Basic Concepts of Machine Learning

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

A Bayesian Learning Approach to Concept-Based Document Classification

A Case Study: News Classification Based on Term Frequency

Ontologies vs. classification systems

DYNAMIC ADAPTIVE HYPERMEDIA SYSTEMS FOR E-LEARNING

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Using Moodle in ESOL Writing Classes

A DISTRIBUTIONAL STRUCTURED SEMANTIC SPACE FOR QUERYING RDF GRAPH DATA

Operational Knowledge Management: a way to manage competence

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Android App Development for Beginners

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

(Sub)Gradient Descent

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability

Using dialogue context to improve parsing performance in dialogue systems

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Ontological spine, localization and multilingual access

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Applications of memory-based natural language processing

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform

CWIS 23,3. Nikolaos Avouris Human Computer Interaction Group, University of Patras, Patras, Greece

The Moodle and joule 2 Teacher Toolkit

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

UK Institutional Research Brief: Results of the 2012 National Survey of Student Engagement: A Comparison with Carnegie Peer Institutions

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Cross Language Information Retrieval

CSL465/603 - Machine Learning

Assignment 1: Predicting Amazon Review Ratings

CS Machine Learning

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

Latent Semantic Analysis

Performance Analysis of Optimized Content Extraction for Cyrillic Mongolian Learning Text Materials in the Database

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

Patterns for Adaptive Web-based Educational Systems

The stages of event extraction

The MEANING Multilingual Central Repository

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

AUTHORING E-LEARNING CONTENT TRENDS AND SOLUTIONS

Lecture 1: Machine Learning Basics

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Moodle Student User Guide

Justin Raisner December 2010 EdTech 503

Applying Information Technology in Education: Two Applications on the Web

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

CS 446: Machine Learning

How to set up gradebook categories in Moodle 2.

Evolutive Neural Net Fuzzy Filtering: Basic Description

Content-free collaborative learning modeling using data mining

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

PowerTeacher Gradebook User Guide PowerSchool Student Information System

Automatic document classification of biological literature

Seminar - Organic Computing

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

Matching Similarity for Keyword-Based Clustering

Online Updating of Word Representations for Part-of-Speech Tagging

Intermediate Computable General Equilibrium (CGE) Modelling: Online Single Country Course

Rule Learning With Negation: Issues Regarding Effectiveness

Introduction to Moodle

Moodle 2 Assignments. LATTC Faculty Technology Training Tutorial

THE WEB 2.0 AS A PLATFORM FOR THE ACQUISITION OF SKILLS, IMPROVE ACADEMIC PERFORMANCE AND DESIGNER CAREER PROMOTION IN THE UNIVERSITY

BYLINE [Heng Ji, Computer Science Department, New York University,

Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanities

Constructing Parallel Corpus from Movie Subtitles

Accelerated Learning Course Outline

The Smart/Empire TIPSTER IR System

Software Project Visualization Using Task Oriented Metaphors

16.1 Lesson: Putting it into practice - isikhnas

Artificial Neural Networks written examination

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

THE world surrounding us involves multiple modalities

The Enterprise Knowledge Portal: The Concept

BENCHMARK TREND COMPARISON REPORT:

Pre-Algebra A. Syllabus. Course Overview. Course Goals. General Skills. Credit Value

Visual CP Representation of Knowledge

STUDENT MOODLE ORIENTATION

Vocabulary Usage and Intelligibility in Learner Language

Universidade do Minho Escola de Engenharia

Language Independent Passage Retrieval for Question Answering

AN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC PP. VI, 282)

Moodle MyFeedback update April 2017

A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval

A Comparison of Two Text Representations for Sentiment Analysis

Transcription:

I-TUTOR Maps Exploring the theoretical background Arianna Pipitone, Vincenzo Cannella, and Roberto Pirrone Department of Chemical, Mechanical, Computer, and Mechanical Engineering (DICGIM)

I-TUTOR overview Intelligent Tutoring for Lifelong Learning An AI enriched VLE, which supports Monitoring Instructional design Self-regulation in students

I-TUTOR overview The plugin I-TUTOR plugin functionalities: Maps Monitoring Profiling Alerting I-TUTOR supports multilingual Moodle courses.

I-TUTOR overview - Users Three kinds of users with different needs Instructional designer Tutor Student

Users - Tutor Monitoring students Single student Entire class Student and class activities Time spent on studying Contents produced by the student/class Social activities of the student/class

Users - Tutor Monitoring (students vs time) At a given time Over a period Single student Disaggregate data analysis Disaggregate diachronic data analisys Entire course Aggregate synchronic data analysis Aggregate diachronic data analysis

Users - Student Self-monitoring Through time - advances throughout the course Proper access to contents - Referencing materials to the topics of the course Self-regulation

Users Instructional designer Knowledge Domain Authoring Course Authoring Overview of the contents of a course Topics of the course Relationships between topics - Semantic similarity - Pre-requisite (Timing of the contents)

Relevant Processes in I-TUTOR Authoring Domain representation Semantic tecnologies Visualization Information Retrieval Navigation Accessing materials Visualization Semantic technologies (Self-)assessment Visualization

Domain representation How to represent knowledge about a domain A set of facts and events Explicit representation Ontologies Conceptual Maps (hypertext ) Implicit representation through verbose texts Definitions Learning materials

Explicit Domain Representation Pros - Based on formal description of domain facts and events Cons - Requires meta knowledge about the kind of representation (ontologies, ERD, general taxonomies) - High complexity

Implicit Domain Representation Pros - Direct use of texts - Verbose - Not structured - Easy to implement - No technical skills needed Cons - Needs intensive information analyses techniques

Information Retrieval and Assessment Many facets to be managed: Content Course Student Class Studying Activities Social Activities and relations between above-mentioned facets

Semantic Technologies Symbolic analysis and linguistic approaches for NLP Semantic parsing Named entity recognition Sub-symbolic analysis Machine learning and statistical evaluation Explicit vs Latent Semantic

Course Visualization Overview of the course Topics Semantic relations between topics Similarity Adjacency Overlapping Hierarchy Topics sequencing

Content Visualization Different kinds of contents Learning materials Contents produced by the students Homeworks Social activities Topic-based classification Distribution over topics

Activities Visualization Studying activities Amount of documents accessed and/or produced by the user Social activities Amount of discussions inside the social media and their relation with course topic

The Proposed Solution A sub-simbolic statistical method for classifying concepts and didactical documents of a course Creation of a semantic space representing the course domain where data analysis can be performed New documents and/or activities can be projected into the space or a new classification can be made Graphic rendering of the space through a ZUI map

I-TUTOR Maps pipeline Documental Corpora Preprocessing TF-IDF LSA SOM Parametric Clustering Maps Base

Documental Corpora Doc. Corpora Preproce ssing TF-IDF LSA SOM Param. Clustering Maps Weighted keywords Hidden database and keywords definition Didactical documents Teacher learning materials Documents by students Social (forum, chat) Didactical (test answers, notes, and so on)

Preprocessing Doc. Corpora Preproce ssing TF-IDF LSA SOM Param. Clustering Maps Stemming Stop-words removal

TF-IDF Doc. Corpora Preproce ssing TF-IDF LSA SOM Param. Clustering Maps TF IDF is a numerical statistic evaluation which reflects how important a word is into a collection of document or corpus. It is computed through the two numbers: where is the number of occurrences of term in the document and is the number of documents. Finally

Latent Semantic Analysis Doc. Corpora Preproce ssing TF-IDF LSA SOM Param. Clustering Maps LSA analyzes relationships between a set of documents and the terms they contain LSA produces a set of concepts related to the documents and terms. LSA assumes that words that are closed in meaning will occur in similar pieces of text.

LSA Occurrence Matrix Doc. Corpora Preproce ssing TF-IDF LSA SOM Param. Clustering Maps The LSA Occurrence Matrix describes the occurrences of terms in documents It is a sparse matrix whose rows correspond to terms and whose columns correspond to documents; We use TF-IDF for weighting the elements of the matrix.

LSA Decomposition Doc. Corpora Preproce ssing TF-IDF LSA SOM Param. Clustering Maps For reducing LSA matrix dimension the Singular Value Decomposition (SVD) is applied.

LSA Spaces Doc. Corpora Preproce ssing TF-IDF LSA SOM Param. Clustering Maps Document s space Concept s space I-TUTOR Conceptual and Activity spaces

Self-Organizing Maps Doc. Corpora Preproce ssing TF-IDF LSA SOM Param. Clustering Maps A type of artificial neural network it is trained through unsupervised learning for producing a map map is a low-dimensional (typically 2D) representation of the input space Two operating modes Training: builds the map using input examples Mapping: automatically classifies a new input vector Vectors from the semantic space are placed into the map by finding the node with the closest weight vector (in the euclidean sense).

Clustering Doc. Corpora Preproce ssing TF-IDF LSA SOM Param. Clustering Maps K-means clustering Parametric clustering changing keywords weights

I-TUTOR Process Pipe Generating maps Doc. Corpora Preproce ssing TF-IDF LSA SOM Param. Clustering Maps Multilinguism Problem Graphic Communication Visual Code

Interface Goals and Solutions Goal Solution Looking at contents and topics together in one shot Concept map Easy to understand Easy to use Expressive Choice of suitable metaphors in the GUI Zooming User Interface Visual Code

Interface - Concept Map and Metaphor Topics as Concepts of the domain Topics and Documents as Points in a map Starry sky as metaphor for enabling quick access to contents Topology and Metrics as metaphors to depict the Coceptual Space

Interface - GUI Zooming User Interface Recursive nesting Arbitrary level of zoom Easy to interact Reduced number of actions Click Drag Familiarity (Google Maps, )

Interface - Visual Code Graphical element Meaning Colours code (distinct colours for distinct region) Cluster of documents sharing a common topic Brightness Shapes Size Spatial closeness Number of documents in a cluster Markers to locate studied documents Number of studied documents Spread of a topic in the course Semantic similarity

Evaluation First piloting round for enabling deep technical upgrades Second piloting round for making intense evaluation of the maps More than 100 students involved in the courses owned by the partners First results are encouraging More than 60% of interviewed people appreciated I-TUTOR as a whole

Future works NLP techniques for processing corpora Topic Categorization (Ontology learning) Symbolic approach Semantic annotation NLP techniques for social activities Pattern definition and matching Co-reference resolution Anaphors

Future works Corpora Clustering Sub-symbolic (Hierachical clustering, multiclustering) Symbolic (faceted classification)

Future works Visualization New metaphors 3D visualization New facets to describe a student Social Interactions (nets, information flows, roles) Complex Behaviours described as combinations of different facets The task at hand