AQAvit - Deep AQAtik
- Mentors
- Shelley Lambert
- Organization
- JBoss Community
- Technologies
- python, bash, seaborn, sklearn, gensim
- Topics
- machine learning, nlp, doc2vec, LDA
The current proposal deals with the task of document similarity utilizing dense text representations. Furthermore, it leverages topic modelling to investigate potential non-informative sets of words which could be removed in the preprocessing phase. The latter step will improve training time and hopefully will increase performance on the document similarity task. The task occurred from the need of matching Github issues with the relevant Jenkins logs in order to correlate ci pipeline errors with prior identified issues.