| Index | index by Group | index by Distribution | index by Vendor | index by creation date | index by Name | Mirrors | Help | Search |
| Name: python3-html-text | Distribution: Fedora Project |
| Version: 0.6.2 | Vendor: Fedora Project |
| Release: 6.fc43 | Build date: Sat Sep 20 00:06:25 2025 |
| Group: Unspecified | Build host: buildhw-x86-03.rdu3.fedoraproject.org |
| Size: 30889 | Source RPM: python-html-text-0.6.2-6.fc43.src.rpm |
| Packager: Fedora Project | |
| Url: https://github.com/zytedata/html-text | |
| Summary: Extract text from HTML | |
How is html_text different from .xpath('//text()') from LXML
or .get_text() from Beautiful Soup?
- Text extracted with html_text does not contain inline styles,
javascript, comments and other text that is not normally visible
to users;
- html_text normalizes whitespace, but in a way smarter than
.xpath('normalize-space()), adding spaces around inline elements
(which are often used as block elements in html markup), and trying
to avoid adding extra spaces for punctuation;
- html-text can add newlines (e.g. after headers or paragraphs), so
that the output text looks more like how it is rendered in browsers.
MIT
* Fri Sep 19 2025 Python Maint <python-maint@redhat.com> - 0.6.2-6 - Rebuilt for Python 3.14.0rc3 bytecode * Fri Aug 15 2025 Python Maint <python-maint@redhat.com> - 0.6.2-5 - Rebuilt for Python 3.14.0rc2 bytecode * Fri Jul 25 2025 Fedora Release Engineering <releng@fedoraproject.org> - 0.6.2-4 - Rebuilt for https://fedoraproject.org/wiki/Fedora_43_Mass_Rebuild * Tue Jun 03 2025 Python Maint <python-maint@redhat.com> - 0.6.2-3 - Rebuilt for Python 3.14 * Sat Jan 18 2025 Fedora Release Engineering <releng@fedoraproject.org> - 0.6.2-2 - Rebuilt for https://fedoraproject.org/wiki/Fedora_42_Mass_Rebuild * Fri Oct 18 2024 Benson Muite <benson_muite@emailplus.org> - 0.6.2-1 - Initial packaging
/usr/lib/python3.14/site-packages/html_text /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/INSTALLER /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/METADATA /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/WHEEL /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/licenses /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/licenses/LICENSE /usr/lib/python3.14/site-packages/html_text-0.6.2.dist-info/top_level.txt /usr/lib/python3.14/site-packages/html_text/__init__.py /usr/lib/python3.14/site-packages/html_text/__pycache__ /usr/lib/python3.14/site-packages/html_text/__pycache__/__init__.cpython-314.opt-1.pyc /usr/lib/python3.14/site-packages/html_text/__pycache__/__init__.cpython-314.pyc /usr/lib/python3.14/site-packages/html_text/__pycache__/html_text.cpython-314.opt-1.pyc /usr/lib/python3.14/site-packages/html_text/__pycache__/html_text.cpython-314.pyc /usr/lib/python3.14/site-packages/html_text/html_text.py /usr/share/doc/python3-html-text /usr/share/doc/python3-html-text/README.rst
Generated by rpm2html 1.8.1
Fabrice Bellet, Fri Oct 24 00:01:23 2025