Initial commit - FLUJOS codebase (production branch)
Includes: FLUJOS app (Node/Flask/Python), FLUJOS_DATOS scripts (scrapers, Keras, Django) Excludes: MongoDB, scraped data, Wikipedia/WikiLeaks dumps, Python venv, node_modules
This commit is contained in:
commit
a40b946163
158 changed files with 196645 additions and 0 deletions
196
FLUJOS_DATOS/NOTICIAS/processed_articles.txt
Executable file
196
FLUJOS_DATOS/NOTICIAS/processed_articles.txt
Executable file
|
|
@ -0,0 +1,196 @@
|
|||
https://reactionary.international/database/
|
||||
https://aleph.occrp.org/
|
||||
https://offshoreleaks.icij.org/
|
||||
https://www.publico.es/
|
||||
https://www.elsaltodiario.com/
|
||||
https://www.nytimes.com/
|
||||
https://www.theguardian.com/
|
||||
https://www.lemonde.fr/
|
||||
https://www.spiegel.de/
|
||||
https://elpais.com/
|
||||
https://www.repubblica.it/
|
||||
https://www.scmp.com/
|
||||
https://www.smh.com.au/
|
||||
https://www.globo.com/
|
||||
https://timesofindia.indiatimes.com/
|
||||
https://www.asahi.com/
|
||||
https://www.washingtonpost.com/
|
||||
https://www.aljazeera.com/
|
||||
https://www.folha.uol.com.br/
|
||||
https://www.telegraph.co.uk/
|
||||
https://www.corriere.it/
|
||||
https://www.clarin.com/
|
||||
https://www.eluniversal.com.mx/
|
||||
https://www.welt.de/
|
||||
https://www.lanacion.com.ar/
|
||||
https://www.bbc.com/
|
||||
https://www.elconfidencial.com/
|
||||
https://www.expansion.com/
|
||||
https://www.lavanguardia.com/
|
||||
https://www.elperiodico.com/
|
||||
https://www.abc.es/
|
||||
https://www.elespanol.com/
|
||||
https://www.lainformacion.com/
|
||||
https://www.elcorreo.com/
|
||||
https://www.canarias7.es/
|
||||
https://www.diariovasco.com/
|
||||
https://www.farodevigo.es/
|
||||
https://www.lavozdegalicia.es/
|
||||
https://www.marca.com/
|
||||
https://www.mundodeportivo.com/
|
||||
https://www.elmundo.es/
|
||||
https://www.wired.com/
|
||||
https://www.techcrunch.com/
|
||||
https://www.cybersecurity-insiders.com/
|
||||
https://www.darkreading.com/
|
||||
https://www.hackread.com/
|
||||
https://www.theregister.com/
|
||||
https://www.csoonline.com/
|
||||
https://www.scmagazine.com/
|
||||
https://www.securityweek.com/
|
||||
https://www.infosecurity-magazine.com/
|
||||
https://www.hackaday.com/
|
||||
https://www.economist.com/
|
||||
https://www.ft.com/
|
||||
https://www.bloomberg.com/
|
||||
https://www.wsj.com/
|
||||
https://www.forbes.com/
|
||||
https://www.businessinsider.com/
|
||||
https://www.reuters.com/
|
||||
https://www.cnbc.com/
|
||||
https://www.nbcnews.com/
|
||||
https://www.cbsnews.com/
|
||||
https://www.abcnews.go.com/
|
||||
https://www.vox.com/
|
||||
https://www.politico.com/
|
||||
https://www.euronews.com/
|
||||
https://www.france24.com/
|
||||
https://www.rt.com/
|
||||
https://www.al-monitor.com/
|
||||
https://www.jpost.com/
|
||||
https://www.haaretz.com/
|
||||
https://www.middleeasteye.net/
|
||||
https://www.indiatoday.in/
|
||||
https://www.chinadaily.com.cn/
|
||||
https://www.japantimes.co.jp/
|
||||
https://www.koreatimes.co.kr/
|
||||
https://www.thehindu.com/
|
||||
https://www.nikkei.com/
|
||||
https://www.manilatimes.net/
|
||||
https://www.bangkokpost.com/
|
||||
https://www.theaustralian.com.au/
|
||||
https://www.nzherald.co.nz/
|
||||
https://www.theglobeandmail.com/
|
||||
https://www.torontostar.com/
|
||||
https://www.ctvnews.ca/
|
||||
https://www.globalnews.ca/
|
||||
https://www.thehill.com/
|
||||
https://www.breitbart.com/
|
||||
https://www.nationalreview.com/
|
||||
https://www.slate.com/
|
||||
https://www.newyorker.com/
|
||||
https://www.atlanticcouncil.org/
|
||||
https://www.chathamhouse.org/
|
||||
https://www.rand.org/
|
||||
https://www.cfr.org/
|
||||
https://www.brookings.edu/
|
||||
https://www.carnegieendowment.org/
|
||||
https://www.wilsoncenter.org/
|
||||
https://www.hoover.org/
|
||||
https://www.csis.org/
|
||||
https://www.heritage.org/
|
||||
https://www.aspi.org.au/
|
||||
https://www.iiss.org/
|
||||
https://www.rusi.org/
|
||||
https://www.intelligenceonline.com/
|
||||
https://www.sit.kb.gov.tr/
|
||||
https://www.securitymagazine.com/
|
||||
https://www.zdnet.com/
|
||||
https://www.helpnetsecurity.com/
|
||||
https://www.bankinfosecurity.com/
|
||||
https://www.nsa.gov/
|
||||
https://www.fbi.gov/
|
||||
https://www.mi5.gov.uk/
|
||||
https://www.mi6.gov.uk/
|
||||
https://www.mss.gov.cn/
|
||||
https://www.bnd.bund.de/
|
||||
https://www.cni.es/
|
||||
https://www.cis.es/
|
||||
https://www.dni.gov/
|
||||
https://www.mossad.gov.il/
|
||||
https://www.afp.gov.au/
|
||||
https://www.royalnavy.mod.uk/
|
||||
https://www.gov.uk/government/organisations/foreign-commonwealth-office
|
||||
https://www.cabinetoffice.gov.uk/
|
||||
https://www.janes.com/
|
||||
https://www.gov.uk/government/organisations/defence-intelligence
|
||||
https://www.nato.int/
|
||||
https://www.un.org/en/
|
||||
https://www.worldbank.org/
|
||||
https://www.imf.org/
|
||||
https://www.weforum.org/
|
||||
https://www.oecd.org/
|
||||
https://www.wto.org/
|
||||
https://www.unesco.org/
|
||||
https://www.who.int/
|
||||
https://www.icc-cpi.int/
|
||||
https://www.eurojust.europa.eu/
|
||||
https://www.europol.europa.eu/
|
||||
https://www.dia.mil/
|
||||
https://www.nro.gov/
|
||||
https://www.cia.gov/
|
||||
https://www.sis.gov.uk/
|
||||
https://www.interpol.int/
|
||||
https://www.intel.gov/
|
||||
https://www.financialtimes.com/
|
||||
https://www.wallstreetjournal.com/
|
||||
https://www.fortune.com/
|
||||
https://www.marketwatch.com/
|
||||
https://www.barrons.com/
|
||||
https://www.nasdaq.com/
|
||||
https://www.sec.gov/
|
||||
https://www.nyse.com/
|
||||
https://www.isda.org/
|
||||
https://www.technologyreview.com/
|
||||
https://www.cyberdefensemagazine.com/
|
||||
https://www.computerweekly.com/
|
||||
https://www.itpro.co.uk/
|
||||
https://www.datacenterdynamics.com/
|
||||
https://www.teiss.co.uk/
|
||||
https://www.tripwire.com/
|
||||
https://www.infoworld.com/
|
||||
https://www.cnet.com/
|
||||
https://www.tomsguide.com/
|
||||
https://www.theverge.com/
|
||||
https://www.arstechnica.com/
|
||||
https://www.engadget.com/
|
||||
https://www.gizmodo.com/
|
||||
https://www.vice.com/
|
||||
https://www.theatlantic.com/
|
||||
https://www.rollingstone.com/
|
||||
https://www.thedailybeast.com/
|
||||
https://www.salon.com/
|
||||
https://www.huffpost.com/
|
||||
https://www.bbc.co.uk/news
|
||||
https://www.dailymail.co.uk/home/index.html
|
||||
https://www.independent.co.uk/
|
||||
https://www.irishtimes.com/
|
||||
https://www.thejournal.ie/
|
||||
https://www.thetimes.co.uk/
|
||||
https://www.thesun.co.uk/
|
||||
https://www.dw.com/
|
||||
https://www.lefigaro.fr/
|
||||
https://www.derstandard.at/
|
||||
https://www.nzz.ch/
|
||||
https://www.eldiario.es/
|
||||
https://www.rtve.es/
|
||||
https://www.elciudadano.com/
|
||||
https://www.apnews.com/
|
||||
https://www.univision.com/
|
||||
https://www.televisa.com/
|
||||
https://www.cnn.com/
|
||||
https://www.foxnews.com/
|
||||
https://www.trtworld.com/
|
||||
https://www.newsweek.com/
|
||||
https://www.time.com/
|
||||
https://www.spectator.co.uk/
|
||||
Loading…
Add table
Add a link
Reference in a new issue