FLUJOS/.gitignore
CAPITANSITO a40b946163 Initial commit - FLUJOS codebase (production branch)
Includes: FLUJOS app (Node/Flask/Python), FLUJOS_DATOS scripts (scrapers, Keras, Django)
Excludes: MongoDB, scraped data, Wikipedia/WikiLeaks dumps, Python venv, node_modules
2026-03-31 14:10:02 +02:00

84 lines
1.3 KiB
Text

# ========================
# DATOS PESADOS - NO SUBIR
# ========================
# MongoDB (3.2 GB)
FLUJOS_DATOS/MONGO/
# Noticias escrapeadas (1.9 GB)
FLUJOS_DATOS/NOTICIAS/archivos/
FLUJOS_DATOS/NOTICIAS/articulos/
FLUJOS_DATOS/NOTICIAS/tokenized/
# Wikipedia (611 MB)
FLUJOS_DATOS/WIKIPEDIA/articulos_wikipedia/
FLUJOS_DATOS/WIKIPEDIA/articulos_tokenizados/
# Torrents / WikiLeaks (1.1 GB)
FLUJOS_DATOS/TORRENTS/TORRENTS_WIKILEAKS_COMPLETO/tokenized/
FLUJOS_DATOS/TORRENTS/TORRENTS_WIKILEAKS_COMPLETO/txt/
# Entorno virtual Python (2.1 GB)
FLUJOS_DATOS/myenv/
myenv/
venv/
env/
.venv/
# NLTK data (50 MB)
nltk_data/
# Bases de datos
*.sqlite3
*.db
# ========================
# DEPENDENCIAS NODE
# ========================
node_modules/
**/node_modules/
# ========================
# SECRETOS Y CONFIG LOCAL
# ========================
.env
.env.*
!.env.example
# ========================
# PYTHON
# ========================
__pycache__/
*.py[cod]
*.pyo
*.pyd
*.egg-info/
# ========================
# TEMPORALES Y BACKUPS
# ========================
*.save
*.bak
*_COPIA*
*~
.DS_Store
Thumbs.db
# ========================
# LOGS
# ========================
logs/
*.log
npm-debug.log*
# ========================
# IDEs
# ========================
.vscode/
.idea/
*.swp
*.swo
# Parcel cache
.cache/
.parcel-cache/