PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Html Parser Python Packages

Python packages with the GitHub topic html-parser. Sorted by relevance, with stars and monthly downloads.
miso-belica
justext

Heuristic based boilerplate removal tool

6.5M 819 89
alphanome-ai
sec-parser

Parse SEC EDGAR HTML documents into a tree of elements that correspond to the visual (semantic) structure of the document.

77K 285 79
kata198
advancedhtmlparser

Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modification, and formatting. Also XPath.

7K 101 25
bug-ops
fast-scrape

🦀 High-performance HTML parsing library. Rust core with native bindings for Python, Node.js & WASM. SIMD-accelerated, memory-safe, consistent API everywhere.

7K 5 0
rajatomar788
pywebcopy

Locally saves webpages to your hard disk with images, css, js & links as is.

6K 639 117
imgurbot12
pyxml3

Pure python3 alternative to stdlib xml.etree with HTML support

4K 1 1
ispras
dedoc

Extract content and logical tree structure from textual documents

2K 681 56
OwenOrcan
yirabot

YiraBot: Simplifying Web Scraping for All. A user-friendly tool for developers and enthusiasts, offering command-line ease and Python integration. Ideal for research, SEO, and data collection.

1K 17 0
Bystroushaak
pydhtmlparser

Python HTML/XML parser for easy web scraping.

307 6 3
lexndru
hap

Hap! is an HTML parser and scraping tool.

296 1 0
jet-logic
alterx

A powerful file processing toolkit for batch transformations of HTML, JSON, TOML, XML, and YAML files

224 0 0
yogendratamang48
parse-utils

Page Parser Utils For scraping, List index update

212 2 0
luxcem
apifier

Apifier is a very simple HTML parser written in Python based on CSS selectors

210 6 1
esign-consulting
qarsmac

Dados de qualidade do ar coletados da Prefeitura do RJ - Secretaria Municipal de Meio Ambiente (SMAC).

199 0 0
yannickperrenet
bookmarkdown

Parse your browser's exported HTML bookmark file to Markdown.

192 18 0
sihaelov
harser

Easy way for HTML parsing and building XPath

174 135 3
MaksimJames
pyhtmltext

pyhtmltext is a usefull and flexible tool for extracting text from html.

158 1 0
kurtnettle
bubt-routinepy

An unofficial Python wrapper of the BUBT Routine API + a robust web scraper and PDF extractor for getting routine data.

102 0 0
yogendratamang48
parse-utils-yogen48

Easy html/json parser for webscraping

97 2 0
vincentlaucsb
pgreaper

A Python library for loading data from various formats into PostgreSQL databases.

83 12 1
Anikeshpatel
dompy-parser

JavaScript Dom Api for Python, Html Parser and a Web scraping tool in python

70 3 0
invanatech
webpage-reader

Reads a webpage and extracts the information like SEO tags, headings, urls based on HTML5 tags and standard styling frameworks

49 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery