Installation with Spacy 3.5

How to install dependency libraries

conda create -n dackar_libs python=3.11

conda activate dackar_libs

pip install spacy==3.5 textacy matplotlib nltk coreferee beautifulsoup4 networkx pysbd tomli numerizer autocorrect pywsd openpyxl quantulum3[classifier] numpy=1.26 scikit-learn pyspellchecker contextualSpellCheck pandas

Download language model from spacy

python -m spacy download en_core_web_lg

python -m coreferee install en

Required nltk data for similarity analysis

python -m nltk.downloader all

Different approach when there is an issue with SSLError

  1. Download en_core_web_lg-3.5.0.whl, then run

python -m pip install ./en_core_web_lg-3.5.0.whl
  1. Download coreferee, then run:

python -m pip install ./coreferee_model_en.zip
  1. run script DACKAR/nltkDownloader.py to download nltk data:

python nltkDownloader.py

or check installing_nltk_data on how to manually install nltk data. For this project, users can also try these steps:

cd ~
mkdir nltk_data
cd nltk_data
mkdir corpora
mkdir taggers
mkdir tokenizers
Dowload wordnet, averaged_perceptron_tagger, punkt
cp -r wordnet ~/nltk_data/corpora/
cp -r averaged_perceptron_tagger ~/nltk_data/taggers/
cp -r punkt ~/nltk_data/tokenizers