This repository contains slides and code samples for using Python to implement some NLP related tasks.
The repository includes the implementation of the following parts:
- Regular Expressions (RegEx)
- Text Tokenization
- Text Processing and Visualization
- Gensim Text Processing
- Named Entity Recognition (NER)
Note
You can either follow the steps below for local installation or use the provided Docker image for a containerized environment.
These commands will set up an isolated environment and install all required packages for this project.
uv sync # installs all dependenciesCodes run on top of a Docker image, ensuring a consistent and reproducible environment.
Important
You will need to have Docker installed on your machine. You can download it from the Docker website.
To run the code, you will need to first pull the Docker image by running the following command:
docker pull abmhamdi/nlpThis may take a while, as it will download and install all necessary dependencies.
docker compose up -dstarts the container in detached modedocker compose downtears down the container
Services can be run by typing the command docker compose up. This will start the Marimo on http://localhost:1357, and you should be able to use Python from within the notebook by starting a new Python notebook.
This project is licensed under the MIT License - see the LICENSE file for details.