feat: add LibreOffice support in Dockerfile and simplify extractor.py for legacy doc/ppt conversion 80911d0 snickerdust commited on 3 days ago
Convert tumpang tindih sentence similarity to pure Jaccard Index, removing all TF-IDF and Cosine dependencies from overlap calculation 77b6e5a snickerdust commited on 6 days ago
Update main overlap_score calculation to use pure Jaccard Index of stopword-filtered keyword sets 8683a49 snickerdust commited on 6 days ago
Implement Jaccard index stopword and conjunction filtering in compute_overlap function 64e8761 snickerdust commited on 6 days ago
Restore weighted ranking selection in BERT backend while preserving raw cosine similarity scores for frontend UI display 7bf441c snickerdust commited on 6 days ago
Clean up project structure: delete model_lexical.py and remove all its references/fallbacks, move compute_overlap to preprocessor.py, and remove TEKS COCOK badge from sidebar, retaining only sentence snippet previews ea23cc3 snickerdust commited on 7 days ago
Fix summary score calculation (remove source_weight multiplication) and add real-time search filtering & text-matching in the sidebar & summary panels fff86ff snickerdust commited on 7 days ago
fix: upgrade torch to 2.5.1+cpu — fixes torch.distributed.tensor.device_mesh import error with latest transformers 5cf1115 snickerdust commited on 7 days ago
fix: simplify topic name to 'Topik N', add bert_error field to API result for debugging c1a6fdb snickerdust commited on 7 days ago
fix: upgrade torch to 2.4.0+cpu (required by sentence-transformers>=3) and pin numpy<2 for binary compat bf43e1b snickerdust commited on 7 days ago
perf: use torch CPU-only wheel (~200MB vs 1.5GB) to speed up HF Spaces rebuild 0d31a72 snickerdust commited on 7 days ago
refactor: simplify BERTopic pipeline to exact LOTO config from notebook (no fallback scenarios) b364da5 snickerdust commited on 7 days ago
chore: align model print statement with BERT model backbone 300124a snickerdust commited on 7 days ago
fix: implement robust BERTopic fallback search to resolve empty topic results and 2.0 scores bdf1987 snickerdust commited on 7 days ago
Fix scipy linalg eigh crash on small datasets by implementing dynamic UMAP/HDBSCAN parameter scaling c0a353b snickerdust commited on 7 days ago
Update model to snickerdust/FT-PLN-IndoBERT and apply optimal LOTO parameters 66228d3 snickerdust commited on 7 days ago