Index | index by Group | index by Distribution | index by Vendor | index by creation date | index by Name | Mirrors | Help | Search |
Name: python3-html-text | Distribution: Fedora Project |
Version: 0.6.2 | Vendor: Fedora Project |
Release: 6.fc43 | Build date: Sat Sep 20 00:06:25 2025 |
Group: Unspecified | Build host: buildhw-x86-03.rdu3.fedoraproject.org |
Size: 30889 | Source RPM: python-html-text-0.6.2-6.fc43.src.rpm |
Packager: Fedora Project | |
Url: https://github.com/zytedata/html-text | |
Summary: Extract text from HTML |
How is html_text different from .xpath('//text()') from LXML or .get_text() from Beautiful Soup? - Text extracted with html_text does not contain inline styles, javascript, comments and other text that is not normally visible to users; - html_text normalizes whitespace, but in a way smarter than .xpath('normalize-space()), adding spaces around inline elements (which are often used as block elements in html markup), and trying to avoid adding extra spaces for punctuation; - html-text can add newlines (e.g. after headers or paragraphs), so that the output text looks more like how it is rendered in browsers.
MIT
* Fri Sep 19 2025 Python Maint <python-maint@redhat.com> - 0.6.2-6 - Rebuilt for Python 3.14.0rc3 bytecode * Fri Aug 15 2025 Python Maint <python-maint@redhat.com> - 0.6.2-5 - Rebuilt for Python 3.14.0rc2 bytecode * Fri Jul 25 2025 Fedora Release Engineering <releng@fedoraproject.org> - 0.6.2-4 - Rebuilt for https://fedoraproject.org/wiki/Fedora_43_Mass_Rebuild * Tue Jun 03 2025 Python Maint <python-maint@redhat.com> - 0.6.2-3 - Rebuilt for Python 3.14 * Sat Jan 18 2025 Fedora Release Engineering <releng@fedoraproject.org> - 0.6.2-2 - Rebuilt for https://fedoraproject.org/wiki/Fedora_42_Mass_Rebuild * Fri Oct 18 2024 Benson Muite <benson_muite@emailplus.org> - 0.6.2-1 - Initial packaging
/usr/lib/python3.14/site-packages/html_text /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/INSTALLER /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/METADATA /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/WHEEL /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/licenses /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/licenses/LICENSE /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/top_level.txt /usr/lib/python3.14/site-packages/html_text/__init__.py /usr/lib/python3.14/site-packages/html_text/__pycache__ /usr/lib/python3.14/site-packages/html_text/__pycache__/__init__.cpython-314.opt-1.pyc /usr/lib/python3.14/site-packages/html_text/__pycache__/__init__.cpython-314.pyc /usr/lib/python3.14/site-packages/html_text/__pycache__/html_text.cpython-314.opt-1.pyc /usr/lib/python3.14/site-packages/html_text/__pycache__/html_text.cpython-314.pyc /usr/lib/python3.14/site-packages/html_text/html_text.py /usr/share/doc/python3-html-text /usr/share/doc/python3-html-text/README.rst
Generated by rpm2html 1.8.1
Fabrice Bellet, Fri Oct 24 00:14:14 2025