Prototypical Implementation and Assessment of Relatedness Search in Laws, Judgments and Commentaries

Similar documents
BUILD-IT: Intuitive plant layout mediated by natural interaction

*** * * * COUNCIL * * CONSEIL OFEUROPE * * * DE L'EUROPE. Proceedings of the 9th Symposium on Legal Data Processing in Europe

AQUA: An Ontology-Driven Question Answering System

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

30 Jahre Kooperation zwischen TU Darmstadt & Tongji University Shanghai

Executive Programmes 2013

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Julie Gawrylowicz. Personal Statement and Research Interests

Matching Similarity for Keyword-Based Clustering

Assignment 1: Predicting Amazon Review Ratings

Resume Book Fall 2012 (PTMSc1) Part-time Master Program in Management (M.Sc.)

Including the Microsoft Solution Framework as an agile method into the V-Modell XT

How to Apply for Fellowships & Internships Connecting students to global careers!

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

University of Waterloo School of Accountancy. AFM 102: Introductory Management Accounting. Fall Term 2004: Section 4

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Xinyu Tang. Education. Research Interests. Honors and Awards. Professional Experience

Learning Methods for Fuzzy Systems

How to read a Paper ISMLL. Dr. Josif Grabocka, Carlotta Schatten

Tarrant County Sheriff's Office 2016 Training Calendar

Natural Language Processing. George Konidaris

THE M.A. DEGREE Revised 1994 Includes All Further Revisions Through May 2012

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

ANNUAL CURRICULUM REVIEW PROCESS for the 2016/2017 Academic Year

Chinese Politics and Diplomacy Program

Corporate Communication

Coding II: Server side web development, databases and analytics ACAD 276 (4 Units)

Development of an IT Curriculum. Dr. Jochen Koubek Humboldt-Universität zu Berlin Technische Universität Berlin 2008

Customised Software Tools for Quality Measurement Application of Open Source Software in Education

Social Media Journalism J336F Unique ID CMA Fall 2012

JN2000: Introduction to Journalism Syllabus Fall 2016 Tuesdays and Thursdays 12:30 1:45 p.m., Arrupe Hall 222

Notenmeldung Abschlussarbeit an der TUM School of Management

WORKSHOP. technologies

Challenges for Higher Education in Europe: Socio-economic and Political Transformations

Collaboration Tier 1

A Case Study: News Classification Based on Term Frequency

Dr Diana Njeri Kimani (Ph.D) P.O. Box Nairobi, Kenya Tel:

11:00 am Robotics and the Law: An American Perspective Prof. Ryan Calo, University of Washington School of Law

FRESNO COUNTY INTELLIGENT TRANSPORTATION SYSTEMS (ITS) PLAN UPDATE

SITUATING AN ENVIRONMENT TO PROMOTE DESIGN CREATIVITY BY EXPANDING STRUCTURE HOLES

Communication and Cybernetics 17

Linking Task: Identifying authors and book titles in verbose queries

Study on the implementation and development of an ECVET system for apprenticeship

Information Session on Overseas Internships Career Center, SAO, HKUST 1 Dec 2016

Great Teachers, Great Leaders: Developing a New Teaching Framework for CCSD. Updated January 9, 2013

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Lecture 1: Basic Concepts of Machine Learning

Exposé for a Master s Thesis

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Dates and Prices 2016

Shared Leadership in Schools On-line, Fall 2008 Michigan State University

Dr. Judith Christina Abdel-Massih-Thiemann. Freelance consultant for organizational and project development

arxiv: v1 [cs.cl] 2 Apr 2017

Problems of the Arabic OCR: New Attitudes

Bachelor of Software Engineering: Emerging sustainable partnership with industry in ODL

INTERNATIONAL STUDENT TIMETABLE BRISBANE CAMPUS

Humboldt-Universität zu Berlin

VSAC Financial Aid Night is scheduled for Thursday, October 6 from 6:30 PM 7:30 PM here at CVU. Senior and junior families are encouraged to attend.

Sec123. Volleyball. 52 Resident Registration begins Aug. 5 Non-resident Registration begins Aug. 14

An Approach for Creating Sentence Patterns for Quality Requirements

BUS Computer Concepts and Applications for Business Fall 2012

IT4BI, Semester 2, UFRT. Welcome address, February 1 st, 2013 Arnaud Giacometti / Patrick Marcel

University of Texas Libraries. Welcome!

1. Introduction. 2. The OMBI database editor

Postprint.

A Comparison of Two Text Representations for Sentiment Analysis

Data Fusion Models in WSNs: Comparison and Analysis

Rule Learning With Negation: Issues Regarding Effectiveness

Vocabulary Usage and Intelligibility in Learner Language

The Ups and Downs of Preposition Error Detection in ESL Writing

Probabilistic Latent Semantic Analysis

CURRICULUM VITAE OF MARIE-LOUISE VIERØ

CollaboFramework. Framework and Methodologies for Collaborative Research in Digital Humanities. DHN Workshop. Organizers:

MS-431 The Cold War Aerospace Technology Oral History Project. Creator: Wright State University. Department of Archives and Special Collections

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

PROJECT PERIODIC REPORT

WE ARE EXCITED TO HAVE ALL OF OUR FFG KIDS BACK FOR OUR SCHOOL YEAR PROGRAM! WE APPRECIATE YOUR CONTINUED SUPPORT AS WE HEAD INTO OUR 8 TH SEASON!

Answers To The Energy Bus Discussion Guide

Master of Statistics - Master Thesis

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Welcome to the University of Hertfordshire and the MSc Environmental Management programme, which includes the following pathways:

A new Dataset of Telephone-Based Human-Human Call-Center Interaction with Emotional Evaluation

MSc MANAGEMENT COMPLEMENT YOUR CAREER - DEVELOP YOUR PROFESSIONAL SKILLS IN AN INTERNATIONAL ENVIRONMENT

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Study in Berlin at the HTW. Study in Berlin at the HTW

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Language Independent Passage Retrieval for Question Answering

MSc MANAGEMENT COMPLEMENT YOUR CAREER - DEVELOP YOUR PROFESSIONAL SKILLS IN AN INTERNATIONAL ENVIRONMENT

CIS 2 Computers and the Internet in Society -

They did a superb job and they did it quick. I was amazed at how fast they did everything that they had to do.

Math 181, Calculus I

Sample Iep Goals For Anxiety


Juniors Spring Presentation

Design and Creation of Games GAME

Speech Emotion Recognition Using Support Vector Machine

Transcription:

Prototypical Implementation and Assessment of Relatedness Search in Laws, Judgments and Commentaries Master s Thesis Kickoff Presentation Philipp Pickel, 27.06.2016 Software Engineering for Business Information Systems (sebis) Department of Informatics Technische Universität München, Germany wwwmatthes.in.tum.de

Overview 1. Motivation 2. Dataset 3. Problem Statement 4. Literature Review 5. Solution Approach 6. Roadmap 160627 Pickel: Master's Thesis - Kickoff Presentation 2

Motivation Huge amounts of legal documents are available in digital format BGH publishes about 300 cases each month Database of juris contains > 4 million documents Beck-online provides > 2.5 million documents Precedents are important in lawyers' everyday work Case law in Anglo-American region Continental Europe High Courts give interpretations of laws Past cases can be used as guidance Transformation of precedents to common law All similar cases can t be found without computer aid 160627 Pickel: Master's Thesis - Kickoff Presentation 3

Motivation Legal data as source of latent structured data Similar structure for certain document types, e.g. judgments, laws Grammatical or spelling mistakes are uncommon Explicit and implicit relations between documents NLP and ML are promising approaches 160627 Pickel: Master's Thesis - Kickoff Presentation 4

Motivation Google Image Search Input: picture Output: similar pictures, information about the picture 160627 Pickel: Master's Thesis - Kickoff Presentation 5

Overview 1. Motivation 2. Dataset 3. Problem Statement 4. Literature Review 5. Solution Approach 6. Roadmap 160627 Pickel: Master's Thesis - Kickoff Presentation 6

Dataset BGH Judgments Extracted from beck-online From 1951 until today (most after 2000) AktG > 900 judgments 2100 words per judgment 40 explicit references per judgment Mietrecht > 700 judgments 2500 words per judgment 35 explicit references per judgment Laws AktG (last 30 years) BGB (latest version) 160627 Pickel: Master's Thesis - Kickoff Presentation 7

Overview 1. Motivation 2. Dataset 3. Problem Statement 4. Literature Review 5. Solution Approach 6. Roadmap 160627 Pickel: Master's Thesis - Kickoff Presentation 8

Problem Statement What determines relatedness in legal documents? How can a system recognize these relations? In which way can this knowledge be presented to a user? 160627 Pickel: Master's Thesis - Kickoff Presentation 9

Overview 1. Motivation 2. Dataset 3. Problem Statement 4. Literature Review 5. Solution Approach 6. Roadmap 160627 Pickel: Master's Thesis - Kickoff Presentation 10

Literature Review Functional Similarity and Relatedness of Texts Recommender Systems in the legal domain Technical Regular Expressions Pattern Matching Bag-of-words Word Vectors POS NER 160627 Pickel: Master's Thesis - Kickoff Presentation 11

Overview 1. Motivation 2. Dataset 3. Problem Statement 4. Literature Review 5. Solution Approach 6. Roadmap 160627 Pickel: Master's Thesis - Kickoff Presentation 12

Solution Approach Quantitative Evaluation Literature Review Evaluation Related Work Implementation Concept Prototypical implementation Derive concept for similarity search Final Evaluation with Expert Interviews 160627 Pickel: Master's Thesis - Kickoff Presentation 13

Overview 1. Motivation 2. Dataset 3. Problem Statement 4. Literature Review 5. Solution Approach 6. Roadmap 160627 Pickel: Master's Thesis - Kickoff Presentation 14

Roadmap Jun. Jul. Aug. Sept. Oct. Nov. Dec. R1 Related Work R2 R3 C1 Derive Concept C2 C3 I1 Implementation I2 I3 E1 Evaluation E2 E3 Final Write Master s Thesis Completed In Progress Not Started 160627 Pickel: Master's Thesis - Kickoff Presentation 15

Thank you for your attention! Any Questions? Philipp Pickel philipp.pickel@tum.de Technische Universität München Department of Informatics Chair of Software Engineering for Business Information Systems Boltzmannstraße 3 85748 Garching bei München Tel +49.89.289. Fax +49.89.289.17136 wwwmatthes.in.tum.de

References Francesconi, Enrico, et al. Semantic processing of legal texts: Where the language of law meets the law of language. Springer, 2010. juris GmbH. juris.de. [Online] [Zitat vom: 20. 06 2016.] http://www.juris.de. Schweighofer, Erich; Winiwarter, Werner; Merkl, Dieter. Information filtering: the computation of similarities in large corpora of legal texts. In: Proceedings of the 5th international conference on Artificial intelligence and law. ACM, 1995. S. 119-126. Wesel, Uwe und Beck, Hans Dieter. 250 Jahre rechtswissenschaftlicher Verlag C.H.Beck: 1763-2013. C.H.Beck, 2015. Winkels, Radboud, et al. Towards a Legal Recommender System. In: JURIX. 2014. S. 169-178. 160627 Pickel: Master's Thesis - Kickoff Presentation 17