Skip to content

Commit 65db5ab

Browse files
authored
Merge pull request #2975 from vinta/chore/cleanup
Cleanup: replace deprecated entries, fix categories, add new entries
2 parents 7b80024 + 79c0be0 commit 65db5ab

File tree

1 file changed

+19
-15
lines changed

1 file changed

+19
-15
lines changed

README.md

Lines changed: 19 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -101,7 +101,6 @@ _Libraries for administrative interfaces._
101101
- [flower](https://github.com/mher/flower) - Real-time monitor and web admin for Celery.
102102
- [func-to-web](https://github.com/offerrall/FuncToWeb) - Instantly create web UIs from Python functions using type hints. Zero frontend code required.
103103
- [jet-bridge](https://github.com/jet-admin/jet-bridge) - Admin panel framework for any application with nice UI (ex Jet Django).
104-
- [streamlit](https://github.com/streamlit/streamlit) - A framework which lets you build dashboards, generate reports, or create chat apps in minutes.
105104
- [wooey](https://github.com/wooey/wooey) - A Django app which creates automatic web UIs for Python scripts.
106105

107106
## Algorithms and Design Patterns
@@ -318,7 +317,6 @@ _Libraries for data analysis._
318317
- [aws-sdk-pandas](https://github.com/aws/aws-sdk-pandas) - Pandas on AWS.
319318
- [datasette](https://github.com/simonw/datasette) - An open source multi-tool for exploring and publishing data.
320319
- [desbordante](https://github.com/desbordante/desbordante-core/) - An open source data profiler for complex pattern discovery.
321-
- [docling](https://github.com/docling-project/docling) - Library for converting documents into structured data.
322320
- [optimus](https://github.com/hi-primus/optimus) - Agile Data Science Workflows made easy with PySpark.
323321
- [pandas](https://github.com/pandas-dev/pandas) - A library providing high-performance, easy-to-use data structures and data analysis tools.
324322
- [pathway](https://github.com/pathwaycom/pathway) - Real-time data processing framework for Python with reactive dataflows.
@@ -348,6 +346,7 @@ _Libraries for visualizing data. Also see [awesome-javascript](https://github.co
348346
- [matplotlib](https://github.com/matplotlib/matplotlib) - A Python 2D plotting library.
349347
- [plotly](https://github.com/plotly/plotly.py) - Interactive graphing library for Python.
350348
- [plotnine](https://github.com/has2k1/plotnine) - A grammar of graphics for Python based on ggplot2.
349+
- [streamlit](https://github.com/streamlit/streamlit) - A framework which lets you build dashboards, generate reports, or create chat apps in minutes.
351350
- [pygal](https://github.com/Kozea/pygal) - A Python SVG Charts Creator.
352351
- [pygraphviz](https://github.com/pygraphviz/pygraphviz/) - Python interface to [Graphviz](http://www.graphviz.org/).
353352
- [pyqtgraph](https://github.com/pyqtgraph/pyqtgraph) - Interactive and realtime 2D/3D/Image plotting and science/engineering widgets.
@@ -385,16 +384,15 @@ _Libraries for connecting and operating databases._
385384
- [kafka-python](https://github.com/dpkp/kafka-python) - The Python client for Apache Kafka.
386385
- [pymongo](https://github.com/mongodb/mongo-python-driver) - The official Python client for MongoDB.
387386
- [redis-py](https://github.com/redis/redis-py) - The Python client for Redis.
388-
- [Beanie](https://github.com/BeanieODM/beanie) - An asynchronous Python object-document mapper (ODM) for MongoDB.
389387

390388
## Date and Time
391389

392390
_Libraries for working with dates and times._
393391

392+
- [zoneinfo](https://docs.python.org/3/library/zoneinfo.html) - (Python standard library) IANA time zone support. Brings the [tz database](https://en.wikipedia.org/wiki/Tz_database) into Python.
394393
- [arrow](https://github.com/arrow-py/arrow) - A Python library that offers a sensible and human-friendly approach to creating, manipulating, formatting and converting dates, times and timestamps.
395394
- [dateutil](https://github.com/dateutil/dateutil) - Extensions to the standard Python [datetime](https://docs.python.org/3/library/datetime.html) module.
396395
- [pendulum](https://github.com/sdispater/pendulum) - Python datetimes made easy.
397-
- [pytz](https://github.com/stub42/pytz) - World timezone definitions, modern and historical. Brings the [tz database](https://en.wikipedia.org/wiki/Tz_database) into Python.
398396

399397
## Debugging Tools
400398

@@ -489,12 +487,14 @@ _Libraries for generating project documentation._
489487

490488
_Libraries for downloading._
491489

492-
- [akshare](https://github.com/jindaxiang/akshare) - A financial data interface library, built for human beings!
493-
- [edgartools](https://github.com/dgunning/edgartools) - Library for downloading structured data from SEC EDGAR filings and XBRL financial statements.
494-
- [OpenBB](https://github.com/OpenBB-finance/OpenBB) - A financial data platform for analysts, quants and AI agents.
495-
- [s3cmd](https://github.com/s3tools/s3cmd) - A command line tool for managing Amazon S3 and CloudFront.
496-
- [yfinance](https://github.com/ranaroussi/yfinance) - Easy Pythonic way to download market and financial data from Yahoo Finance.
497-
- [youtube-dl](https://github.com/ytdl-org/youtube-dl/) - A command-line program to download videos from YouTube and other video sites.
490+
- General
491+
- [s3cmd](https://github.com/s3tools/s3cmd) - A command line tool for managing Amazon S3 and CloudFront.
492+
- [youtube-dl](https://github.com/ytdl-org/youtube-dl/) - A command-line program to download videos from YouTube and other video sites.
493+
- Financial Data
494+
- [akshare](https://github.com/jindaxiang/akshare) - A financial data interface library, built for human beings!
495+
- [edgartools](https://github.com/dgunning/edgartools) - Library for downloading structured data from SEC EDGAR filings and XBRL financial statements.
496+
- [OpenBB](https://github.com/OpenBB-finance/OpenBB) - A financial data platform for analysts, quants and AI agents.
497+
- [yfinance](https://github.com/ranaroussi/yfinance) - Easy Pythonic way to download market and financial data from Yahoo Finance.
498498

499499
## Editor Plugins and IDEs
500500

@@ -537,7 +537,6 @@ _Libraries for Python version and virtual environment management._
537537

538538
_Libraries for file manipulation._
539539

540-
- [markitdown](https://github.com/microsoft/markitdown) - Python tool for converting files and office documents to Markdown.
541540
- [mimetypes](https://docs.python.org/3/library/mimetypes.html) - (Python standard library) Map filenames to MIME types.
542541
- [path.py](https://github.com/jaraco/path.py) - A module wrapper for [os.path](https://docs.python.org/3/library/os.path.html).
543542
- [pathlib](https://docs.python.org/3/library/pathlib.html) - (Python standard library) A cross-platform, object-oriented path library.
@@ -796,6 +795,7 @@ _Libraries that implement Object-Relational Mapping or data mapping techniques._
796795
- [pydal](https://github.com/web2py/pydal/) - A pure Python Database Abstraction Layer.
797796
- [SQLModel](https://github.com/fastapi/sqlmodel) - SQLModel is based on Python type annotations, and powered by Pydantic and SQLAlchemy.
798797
- NoSQL Databases
798+
- [Beanie](https://github.com/BeanieODM/beanie) - An asynchronous Python object-document mapper (ODM) for MongoDB.
799799
- [mongoengine](https://github.com/MongoEngine/mongoengine) - A Python Object-Document-Mapper for working with MongoDB.
800800
- [ODMantic](https://github.com/art049/odmantic) - Sync and Async ODM for MongoDB built on top of Pydantic for model definition and validation.
801801
- [PynamoDB](https://github.com/pynamodb/PynamoDB) - A Pythonic interface for [Amazon DynamoDB](https://aws.amazon.com/dynamodb/).
@@ -970,8 +970,10 @@ _Shells built with Python._
970970
_Libraries for parsing and manipulating specific text formats._
971971

972972
- General
973+
- [docling](https://github.com/docling-project/docling) - Library for converting documents into structured data.
973974
- [kreuzberg](https://github.com/kreuzberg-dev/kreuzberg) - High-performance document extraction library with a Rust core, supporting 62+ formats including PDF, Office, images with OCR, HTML, email, and archives.
974975
- [tablib](https://github.com/jazzband/tablib) - A module for Tabular Datasets in XLS, CSV, JSON, YAML.
976+
- [textract](https://github.com/deanmalmgren/textract) - Extract text from any document, Word, PowerPoint, PDFs, etc.
975977
- Office
976978
- [docxtpl](https://github.com/elapouya/python-docx-template) - Editing a docx document by jinja2 template
977979
- [openpyxl](https://openpyxl.readthedocs.io/en/stable/) - A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files.
@@ -987,11 +989,14 @@ _Libraries for parsing and manipulating specific text formats._
987989
- [PyPDF2](https://github.com/mstamy2/PyPDF2) - A library capable of splitting, merging and transforming PDF pages.
988990
- [ReportLab](https://www.reportlab.com/opensource/) - Allowing Rapid creation of rich PDF documents.
989991
- Markdown
990-
- [Jimmy](https://github.com/marph91/jimmy) - Convert your notes to Markdown.
991-
- [Mistune](https://github.com/lepture/mistune) - Fastest and full featured pure Python parsers of Markdown.
992-
- [Python-Markdown](https://github.com/waylan/Python-Markdown) - A Python implementation of John Gruber’s Markdown.
992+
- [markdown-it-py](https://github.com/executablebooks/markdown-it-py) - Markdown parser with 100% CommonMark support, extensions, and syntax plugins.
993+
- [markdown](https://github.com/waylan/Python-Markdown) - A Python implementation of John Gruber’s Markdown.
994+
- [markitdown](https://github.com/microsoft/markitdown) - Python tool for converting files and office documents to Markdown.
995+
- [mistune](https://github.com/lepture/mistune) - Fastest and full featured pure Python parsers of Markdown.
993996
- YAML
994997
- [PyYAML](https://github.com/yaml/pyyaml) - YAML implementations for Python.
998+
- TOML
999+
- [tomllib](https://docs.python.org/3/library/tomllib.html) - (Python standard library) Parse TOML files.
9951000
- CSV
9961001
- [csvkit](https://github.com/wireservice/csvkit) - Utilities for converting to and working with CSV.
9971002
- Archive
@@ -1132,7 +1137,6 @@ _Libraries for extracting web contents._
11321137
- [python-readability](https://github.com/buriy/python-readability) - Fast Python port of arc90's readability tool.
11331138
- [requests-html](https://github.com/psf/requests-html) - Pythonic HTML Parsing for Humans.
11341139
- [sumy](https://github.com/miso-belica/sumy) - A module for automatic summarization of text documents and HTML pages.
1135-
- [textract](https://github.com/deanmalmgren/textract) - Extract text from any document, Word, PowerPoint, PDFs, etc.
11361140
- [toapi](https://github.com/gaojiuli/toapi) - Every web site provides APIs.
11371141

11381142
## Web Crawling

0 commit comments

Comments
 (0)