Text Authorship Analysis ML Frameworks

Tools and models for analyzing written text to identify authorship, detect stylistic patterns, model topics, and classify writing characteristics. Includes LDA, topic modeling, stylometric analysis, and authorship attribution. Does NOT include general NLP, text classification for non-authorship tasks, or content moderation.

There are 23 text authorship analysis frameworks tracked. 3 score above 50 (established tier). The highest-rated is bigartm/bigartm at 68/100 with 672 stars.

Get all 23 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=text-authorship-analysis&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 bigartm/bigartm

Fast topic modeling platform

68
Established
2 piskvorky/gensim

Topic Modelling for Humans

57
Established
3 vi3k6i5/GuidedLDA

semi supervised guided topic model with custom guidedLDA

52
Established
4 gregversteeg/corex_topic

Hierarchical unsupervised and semi-supervised topic models for sparse count...

49
Emerging
5 microsoft/knowledge-extraction-recipes-forms

Knowledge Extraction For Forms Accelerators & Examples

48
Emerging
6 centre-for-humanities-computing/tweetopic

Blazing fast topic modelling for short texts.

40
Emerging
7 google-marketing-solutions/ml_toast

Cluster multilingual search terms captured from different time windows into...

38
Emerging
8 dayyass/latent-semantic-analysis

Pipeline for training LSA models using Scikit-Learn.

31
Emerging
9 A-safarji/NLP-topic-modeling-project

Topic Modeling on subreddit (NLP). In order to work on NLP topic modeling,...

30
Emerging
10 Develop-Packt/Topic-Modeling-and-Theme-Extraction

In this module you will learn how to analyze topic modeling output from...

30
Emerging
11 arjo129/LangCluster

A visuallization for cognates in various languages and how they spread

25
Experimental
12 Su1ph3r/seshat

Stylometric Authorship Attribution & Psychological Profiling Tool

22
Experimental
13 GabrielePisciotta/NLP-Authorship-Verification-Case-Study

Natural Language Processing project covering the task of Authorship Verification

21
Experimental
14 alejandrejames/project-thesis

A topic modelling toolkit that can collect, pre-proccess, generate topic...

21
Experimental
15 nigosto/authorship-recognition

Analysis and comparison of different machine learning models for authorship...

21
Experimental
16 camara94/Analyse_semantique_latente

Cet article passe en revue l'analyse sémantique latente (LSA), une théorie...

19
Experimental
17 Ahmadhammam03/topic-modeling-lda-nmf

Comprehensive topic modeling with LDA and NMF algorithms for discovering...

18
Experimental
18 Sajjad-Shahali/Text_Authorship_Detection

6-class text authorship detection pipeline for human and LLM-generated text...

14
Experimental
19 chrisPiemonte/argonaut

Argument Mining from online debates and hybrid-semantics reasoning

13
Experimental
20 IAmGreyBunny/ICTCompetition

For Huawei ICT Competition Finals (2021-2022) - AI NLP with BERT and...

12
Experimental
21 kthomas4031/Author-Detector

Detects the author based on linguistic signatures

11
Experimental
22 HayatiYrtgl/topic_modeling_sklearn_LDA

Python code: Uses LDA for topic modeling on text data.

11
Experimental
23 anildervis/codexa-code-authorship

Code authorship attribution

11
Experimental