Includes: FLUJOS app (Node/Flask/Python), FLUJOS_DATOS scripts (scrapers, Keras, Django) Excludes: MongoDB, scraped data, Wikipedia/WikiLeaks dumps, Python venv, node_modules
84 lines
1.3 KiB
Text
84 lines
1.3 KiB
Text
# ========================
|
|
# DATOS PESADOS - NO SUBIR
|
|
# ========================
|
|
|
|
# MongoDB (3.2 GB)
|
|
FLUJOS_DATOS/MONGO/
|
|
|
|
# Noticias escrapeadas (1.9 GB)
|
|
FLUJOS_DATOS/NOTICIAS/archivos/
|
|
FLUJOS_DATOS/NOTICIAS/articulos/
|
|
FLUJOS_DATOS/NOTICIAS/tokenized/
|
|
|
|
# Wikipedia (611 MB)
|
|
FLUJOS_DATOS/WIKIPEDIA/articulos_wikipedia/
|
|
FLUJOS_DATOS/WIKIPEDIA/articulos_tokenizados/
|
|
|
|
# Torrents / WikiLeaks (1.1 GB)
|
|
FLUJOS_DATOS/TORRENTS/TORRENTS_WIKILEAKS_COMPLETO/tokenized/
|
|
FLUJOS_DATOS/TORRENTS/TORRENTS_WIKILEAKS_COMPLETO/txt/
|
|
|
|
# Entorno virtual Python (2.1 GB)
|
|
FLUJOS_DATOS/myenv/
|
|
myenv/
|
|
venv/
|
|
env/
|
|
.venv/
|
|
|
|
# NLTK data (50 MB)
|
|
nltk_data/
|
|
|
|
# Bases de datos
|
|
*.sqlite3
|
|
*.db
|
|
|
|
# ========================
|
|
# DEPENDENCIAS NODE
|
|
# ========================
|
|
node_modules/
|
|
**/node_modules/
|
|
|
|
# ========================
|
|
# SECRETOS Y CONFIG LOCAL
|
|
# ========================
|
|
.env
|
|
.env.*
|
|
!.env.example
|
|
|
|
# ========================
|
|
# PYTHON
|
|
# ========================
|
|
__pycache__/
|
|
*.py[cod]
|
|
*.pyo
|
|
*.pyd
|
|
*.egg-info/
|
|
|
|
# ========================
|
|
# TEMPORALES Y BACKUPS
|
|
# ========================
|
|
*.save
|
|
*.bak
|
|
*_COPIA*
|
|
*~
|
|
.DS_Store
|
|
Thumbs.db
|
|
|
|
# ========================
|
|
# LOGS
|
|
# ========================
|
|
logs/
|
|
*.log
|
|
npm-debug.log*
|
|
|
|
# ========================
|
|
# IDEs
|
|
# ========================
|
|
.vscode/
|
|
.idea/
|
|
*.swp
|
|
*.swo
|
|
|
|
# Parcel cache
|
|
.cache/
|
|
.parcel-cache/
|