feat: add LibreOffice support in Dockerfile and simplify extractor.py for legacy doc/ppt conversion 80911d0 snickerdust commited on about 1 hour ago
Ignore indobert_finetuned and venv folders in firebase.json to prevent large file uploads to Firebase Hosting 21856bb snickerdust commited on 2 days ago
Convert tumpang tindih sentence similarity to pure Jaccard Index, removing all TF-IDF and Cosine dependencies from overlap calculation 77b6e5a snickerdust commited on 3 days ago
Update main overlap_score calculation to use pure Jaccard Index of stopword-filtered keyword sets 8683a49 snickerdust commited on 3 days ago
Implement Jaccard index stopword and conjunction filtering in compute_overlap function 64e8761 snickerdust commited on 3 days ago
Restore weighted ranking selection in BERT backend while preserving raw cosine similarity scores for frontend UI display 7bf441c snickerdust commited on 3 days ago
Clean up project structure: delete model_lexical.py and remove all its references/fallbacks, move compute_overlap to preprocessor.py, and remove TEKS COCOK badge from sidebar, retaining only sentence snippet previews ea23cc3 snickerdust commited on 3 days ago
Update index.html to render search snippet previews containing matching original sentences under document items in sidebar 4a366c5 snickerdust commited on 3 days ago
Fix summary score calculation (remove source_weight multiplication) and add real-time search filtering & text-matching in the sidebar & summary panels fff86ff snickerdust commited on 3 days ago
feat: sort summary by score descending in frontend display c8dbbaa snickerdust commited on 4 days ago
fix: upgrade torch to 2.5.1+cpu — fixes torch.distributed.tensor.device_mesh import error with latest transformers 5cf1115 snickerdust commited on 4 days ago
fix: simplify topic name to 'Topik N', add bert_error field to API result for debugging c1a6fdb snickerdust commited on 4 days ago
fix: upgrade torch to 2.4.0+cpu (required by sentence-transformers>=3) and pin numpy<2 for binary compat bf43e1b snickerdust commited on 4 days ago
perf: use torch CPU-only wheel (~200MB vs 1.5GB) to speed up HF Spaces rebuild 0d31a72 snickerdust commited on 4 days ago
refactor: simplify BERTopic pipeline to exact LOTO config from notebook (no fallback scenarios) b364da5 snickerdust commited on 4 days ago
chore: align model print statement with BERT model backbone 300124a snickerdust commited on 4 days ago
fix: implement robust BERTopic fallback search to resolve empty topic results and 2.0 scores bdf1987 snickerdust commited on 4 days ago
Fix scipy linalg eigh crash on small datasets by implementing dynamic UMAP/HDBSCAN parameter scaling c0a353b snickerdust commited on 4 days ago
Remove model weights and duplicate clone from Git tracking, add to gitignore fd75153 snickerdust commited on 4 days ago
Update model to snickerdust/FT-PLN-IndoBERT and apply optimal LOTO parameters 66228d3 snickerdust commited on 4 days ago
Merge branch 'main' of https://huggingface.co/spaces/snickerdust/PLN-summary 0668112 snickerdust commited on 22 days ago