CLiPS, University of Antwerp

Computational (Psycho)Linguistics

Technologies
python, javascript, machine learning, mongodb
Topics
text analytics, machine learning, computational linguistics, artificial intelligence, data mining
Computational (Psycho)Linguistics

The Computational Linguistics & Psycholinguistics Research Group of the University of Antwerp (CLiPS, http://www.clips.uantwerpen.be) focuses on applications of statistical and machine learning methods, trained on corpus data, to explain human language acquisition and processing data, and to develop automatic text analysis systems that are accurate, efficient, and robust enough to be used in practical applications.

There are 3 subgroups to CLiPS: (1) the sociolinguistics group studies language variation in different demographic groups. The (2) psycholinguists research the effect of cochlear implantation on child language acquisition. This description focuses on the (3) computational linguistics subgroup.

Current research at CLiPS' Computational Linguistics Group focuses on developing tools that can extract data from social media messages, such as fine-grained sentiment analysis, and the detection of subversive behavior on social network sites (sexually transgressive behavior, hate speech, ...). Furthermore, CLiPS is well known for its work on computational stylometry and has developed state-of-the-art technology for authorship attribution, as well as author profiling, i.e. the detection of personality, age and gender of the author of a text, based on personal writing style. Another line of research at CLiPS focuses on computational psycholinguistics and researches psychologically plausible models of child language acquisition and bilinguality. CLiPS also researches and develops tools for biomedical text mining.

Over the years, CLiPS has established a strong reputation in the application of machine learning methods on a variety of language technology problems for a wide range of languages. To capitalize on this reputation, a spin-off company, Textgain (textgain.com), was founded in 2015 that aims to bring CLiPS technology to the market for commercial purposes.

2017 Program

Successful Projects

Contributor
Masha Ivenskaya
Mentor
Guy De Pauw, Vincent Merckx, Tom De Smedt, Walter Daelemans
Organization
CLiPS, University of Antwerp
Fake News Detection
I am a graduate student in Computational Linguistics with a keen interest in fake news detection. I propose to build a classifier for detecting...
Contributor
Markus Beuckelmann
Mentor
Guy De Pauw, Tom De Smedt, Walter Daelemans
Organization
CLiPS, University of Antwerp
Modernizing the Pattern library
The purpose of this GSoC project will be to modernize Pattern, a Python library for machine learning, natural language processing (NLP) and web...