Antes la raiz tenia 20+ archivos sueltos. Ahora organizado en:
docs/ 10 archivos .md de documentacion tecnica
scripts/ 3 scripts utilitarios (credentials, migrate, verify)
config/ entity_config.json (aliases y blacklist NER)
data/ feeds.csv (feeds precargados)
Eliminados restos de Docker que ya no aplican:
.dockerignore, .env.example, .env.secure.example, nginx.conf (raiz)
Makefile: eliminados targets docker-build, añadidos install/rebuild/check/poc
Referencias actualizadas en:
deploy/debian/install.sh entity_config.json -> config/entity_config.json
deploy/debian/build.sh entity_config.json -> config/entity_config.json
README.md links a docs/ y data/ actualizados,
arbol de estructura del repo reescrito
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2.1 KiB
2.1 KiB
Deployment Guide
This guide describes how to deploy the application to a new server.
Prerequisites
- Linux Server (Ubuntu 22.04+ recommended)
- NVIDIA GPU: Required for translation, embeddings, and NER services.
- NVIDIA Container Toolkit: Must be installed to allow Docker to access the GPU.
- Docker & Docker Compose: Latest versions.
- Git: To clone the repository.
- External Service: An instance of AllTalk running externally or on the host (port 7851 by default).
Deployment Steps
-
Clone the Repository
git clone <your-repo-url> cd <your-repo-name> -
Configure Environment Variables Copy the example configuration file:
cp .env.example .envEdit
.envand set secure passwords and configuration:nano .env- Change
POSTGRES_PASSWORDandDB_PASSto a strong unique password. - Change
SECRET_KEYto a long random string. - Verify
ALLTALK_URLpoints to your AllTalk instance (default assumes host machine access).
- Change
-
Start the Services Run the following command to build and start the application:
docker compose up -d --build -
Database Initialization The database will automatically initialize on the first run using the scripts in
init-db/. This may take a few minutes. Check logs with:docker compose logs -f db -
Verify Deployment Access the application at
http://<your-server-ip>:8001.
Important Notes
- Models: The application mounts
./modelsand./hf_cacheto persist AI models. On the first run, it will attempt to download necessary models (NLLB, BERT, etc.), which requires significant bandwidth and time. - Data Persistence: Database data is stored in
./pgdata(mapped in docker-compose). Ensure this directory is backed up. - Security: Ensure port 5432 (Postgres) and 6379 (Redis) are firewall-protected and not exposed to the public internet unless intended (Docker maps them to the host network).