Oishi Deb's picture

1 2

Oishi Deb PRO

OishiDeb

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Measuring what Matters: Construct Validity in Large Language Model Benchmarks

authored a paper 3 months ago

Measuring what Matters: Construct Validity in Large Language Model Benchmarks

liked a dataset 7 months ago

unreasonablebenchmark/unreasonable-benchmark

View all activity

Organizations

upvoted a paper about 2 months ago

Measuring what Matters: Construct Validity in Large Language Model Benchmarks

Paper • 2511.04703 • Published Nov 3, 2025 • 8

authored a paper 3 months ago

Measuring what Matters: Construct Validity in Large Language Model Benchmarks

Paper • 2511.04703 • Published Nov 3, 2025 • 8

liked 2 datasets 7 months ago

unreasonablebenchmark/unreasonable-benchmark

Viewer • Updated Aug 8, 2025 • 128 • 27 • 2

ambean/construct-validity-review

Preview • Updated Nov 23, 2025 • 18 • 5