Commit Graph

  • 29881deca5 Updated: README.md main Carlos Sousa 2021-09-01 20:56:57 +02:00
  • d655091a36 Hotfix: Dependencies were missing in functions.py Carlos Sousa 2021-08-06 13:15:59 +02:00
  • 18e150e71a Fix #4 Carlos Sousa 2021-08-06 01:22:44 +02:00
  • feb97c7039 Added: Remove Duplicates Carlos Sousa 2021-08-06 01:18:57 +02:00
  • e7f144a438 Updated: Improved ErrorHandling Carlos Sousa 2021-07-29 20:36:03 +02:00
  • 5d1dcfcccb Updated: PoC database.sql Carlos Sousa 2021-07-29 20:35:45 +02:00
  • 69a8f85ae3 Updated: Better Error Handling Carlos Sousa 2021-07-29 20:13:48 +02:00
  • 6419ba97a7 Updated: MariaDB as Back Completed Carlos Sousa 2021-07-29 17:42:13 +02:00
  • 96e29fd448 Changed: Folder Structure Carlos Sousa 2021-07-28 19:52:15 +02:00
  • 74d623d39e Added [ToDo:] Carlos Sousa 2021-05-04 00:06:23 +02:00
  • f59931efd9 Added ToDo Carlos Sousa 2021-05-04 00:05:11 +02:00
  • 4c437ade07 Merge branch 'main' of https://github.com/zebrajr/imdbscrapper into main Carlos Sousa 2021-05-03 17:54:24 +02:00
  • 51ab8659fb Fixed Description - ";" replaced Carlos Sousa 2021-05-03 17:54:21 +02:00
  • aa6f8e0f8d Fixed Description - ";" replaced Carlos Sousa 2021-05-03 17:44:46 +02:00
  • 05dfb1e15b Added link to scrap data repo Carlos Sousa 2021-05-03 17:06:35 +02:00
  • ebd422d10c Changed limit on currentEndURL from 0 to endURL Carlos Sousa 2021-05-03 16:07:22 +02:00
  • 301de775a8 Added env variables for startURL, endURL, steUpCycle, nrProcesses to docker Carlos Sousa 2021-05-03 15:01:27 +02:00
  • 8d6a3aec8a Merge pull request #1 from zebrajr/devProcesses Carlos Sousa 2021-05-03 14:45:15 +02:00
  • ecb7da2939 Migrated from single to parallel processes Carlos Sousa 2021-05-03 14:43:08 +02:00
  • 4999912989 Trying I/O improvements Carlos Sousa 2021-05-03 02:49:25 +02:00
  • f5cb768a65 Started multithreadding POC Carlos Sousa 2021-05-03 02:06:12 +02:00
  • b1984e1fdf 10000000 - 9956224 Carlos Sousa 2021-05-02 21:54:14 +02:00
  • 69c3632d05 Added ToDo legacy-SingleProcess Carlos Sousa 2021-05-02 18:38:56 +02:00
  • 32f2d04397 Added reCheck file logic for better performance on reChecks. Changed check to descending order. Carlos Sousa 2021-05-02 18:36:17 +02:00
  • 7613816d1a Added Total Rating Count Carlos Sousa 2021-05-02 16:25:02 +02:00
  • 641d5faf4c Added ToDo Carlos Sousa 2021-05-02 14:56:18 +02:00
  • 00b9cc19cf First 1640 entries indexed Carlos Sousa 2021-05-02 14:38:14 +02:00
  • 8511975591 Fixed grammatic Carlos Sousa 2021-05-02 14:35:54 +02:00
  • e8034d23ac Updated Task. Updated Action. Carlos Sousa 2021-05-02 14:34:54 +02:00
  • bacdbcf75f Fixed info on installing requirements Carlos Sousa 2021-05-02 14:26:43 +02:00
  • 3bf7895f32 Removed invalid options Carlos Sousa 2021-05-02 14:25:40 +02:00
  • 894b5c7f0c README.md updated Carlos Sousa 2021-05-02 14:23:12 +02:00
  • d7548ba624 Added better error handling. Added continue from last value Carlos Sousa 2021-05-02 14:10:12 +02:00
  • af7fe142df Main Logic completed. Carlos Sousa 2021-05-02 13:44:34 +02:00
  • 7f47c10346 Main logic done. Will write movies to movies.csv and shows to series.csv Carlos Sousa 2021-05-01 23:52:37 +02:00
  • c224144cd5 Added running as non-root user Carlos Sousa 2021-05-01 23:52:01 +02:00
  • 54289759ea Added directory for storage Carlos Sousa 2021-05-01 23:51:38 +02:00
  • eaf30aa206 Added basic file layout, dockerfile and docker-compose.yml Carlos Sousa 2021-04-30 13:16:40 +02:00
  • cfe95825bc Initial commit Carlos Sousa 2021-04-30 13:11:32 +02:00