Spaces:

Arif-Badhon
/

llm-data-analyzer

Sleeping

File size: 3,649 Bytes

ef17ebc
5c19816
 
 
 
 
 
 
 
ef17ebc
8c44361
ef17ebc
8c44361
ef17ebc
8c44361
ef17ebc
8c44361
ef17ebc
 
 
 
8c44361
ef17ebc
8c44361
ef17ebc
 
 
 
8c44361
ef17ebc
8c44361
ef17ebc
 
 
 
 
8c44361
ef17ebc
8c44361
ef17ebc
 
 
 
 
 
 
 
8c44361
ef17ebc
8c44361
5c19816
8c44361
ef17ebc
5c19816
 
 
8c44361
5c19816
 
 
 
8c44361
 
5c19816
 
ef17ebc
5c19816
ef17ebc
5c19816
ef17ebc
 
 
 
5c19816
ef17ebc
 
 
 
5c19816
ef17ebc
 
 
 
5c19816
ef17ebc
5c19816
ef17ebc
5c19816
ef17ebc
 
 
 
 
5c19816
ef17ebc
 
5c19816
ef17ebc
 
 
5c19816
ef17ebc
5c19816
ef17ebc
 
 
5c19816
ef17ebc
 
5c19816
ef17ebc
5c19816
ef17ebc
 
 
 
8c44361
ef17ebc
 
 
 
8c44361
ef17ebc
 
 
 
8c44361
ef17ebc
 
 
 
8c44361
ef17ebc
8c44361
ef17ebc
 
 
 
 
 
 
 
8c44361
ef17ebc
8c44361
ef17ebc
8c44361
ef17ebc
8c44361
ef17ebc
8c44361
ef17ebc
8c44361
5c19816
ef17ebc
 
 
8c44361
ef17ebc
8c44361
ef17ebc

---
title: LLM Data Analyzer
emoji: 📊
colorFrom: blue
colorTo: indigo
sdk: docker
sdk_version: latest
app_file: app.py
pinned: false
---

# 📊 LLM Data Analyzer

An AI-powered tool for analyzing data and having conversations with an intelligent assistant powered by Llama 2.

## Features

- **📤 Upload & Analyze**: Upload CSV or Excel files and get instant analysis
- **💬 Chat**: Have conversations with Llama 2 AI assistant
- **📊 Data Statistics**: View comprehensive data summaries and insights
- **🚀 Fast**: Runs on free Hugging Face CPU tier

## How to Use

1. **Upload Data** - Start by uploading a CSV or Excel file
2. **Preview** - Review your data and statistics
3. **Ask Questions** - Get AI-powered analysis and insights
4. **Chat** - Have follow-up conversations with the AI

## Technology Stack

- **Model**: Llama 2 7B (quantized to 4-bit)
- **Framework**: Streamlit
- **Inference Engine**: Llama.cpp
- **Hosting**: Hugging Face Spaces
- **Language**: Python 3.10+

## Performance

| Metric | Value |
|--------|-------|
| Speed | ~5-10 tokens/second (free CPU) |
| Model Size | 4GB (quantized) |
| Context Window | 2048 tokens |
| First Load | ~30 seconds (model download) |
| Subsequent Responses | ~5-15 seconds |
| Hardware | Free Hugging Face CPU |

## Local Development (Faster)

For faster local development with GPU acceleration on Apple Silicon Mac:

```bash
# Clone the repository
git clone https://github.com/Arif-Badhon/LLM-Data-Analyzer
cd LLM-Data-Analyzer

# Switch to huggingface-deployment branch
git checkout huggingface-deployment

# Install dependencies
pip install -r requirements.txt

# Run with MLX (Apple Silicon GPU - ~70 tokens/second)
streamlit run app.py
```

## Deployment Options

### Option 1: Hugging Face Space (Free)
- CPU-based inference
- Speed: 5-10 tokens/second
- Cost: Free

### Option 2: Local with MLX (Fastest)
- GPU-accelerated on Apple Silicon
- Speed: 70+ tokens/second
- Cost: Free (uses your Mac)

### Option 3: Hugging Face PRO (Fast)
- GPU-accelerated inference
- Speed: 50+ tokens/second
- Cost: $9/month

## Getting Started

### Quick Start (3 minutes)

```bash
# 1. Install Python 3.10+
# 2. Clone repo
git clone https://github.com/Arif-Badhon/LLM-Data-Analyzer
cd LLM-Data-Analyzer

# 3. Install dependencies
pip install -r requirements.txt

# 4. Run Streamlit app
streamlit run app.py
```

### With Docker (Local Development)

```bash
# Make sure Docker Desktop is running
docker-compose up --build

# Access at http://localhost:8501
```

## Troubleshooting

### "Model download failed"
- Check internet connection
- HF Spaces need internet to download models from Hugging Face Hub
- Wait and refresh the page

### "App takes too long to load"
- Normal on first request (10-30 seconds)
- Model is being downloaded and cached
- Subsequent requests are much faster

### "Out of memory"
- Free tier CPU is limited
- Unlikely with quantized 4GB model
- If it happens, upgrade to HF PRO

### "Slow responses"
- Free tier CPU is slower than GPU
- Expected: 5-10 tokens/second
- For faster responses: use local MLX (70 t/s) or upgrade HF tier

## Technologies Used

- **Python** - Core language
- **Streamlit** - Web UI framework
- **Llama 2** - Large language model
- **Llama.cpp** - CPU inference
- **MLX** - Apple Silicon GPU inference
- **Pandas** - Data processing
- **Docker** - Containerization
- **Hugging Face Hub** - Model hosting

## License

MIT License

## Author

**Arif Badhon**

## Support

If you encounter any issues:
1. Check the Troubleshooting section above
2. Review Hugging Face Spaces Docs
3. Open an issue on GitHub

---

**Happy analyzing! 🚀**