Proposal for "HTML_Safe"

» Metadata	» Status
Category: HTML Proposer: Roman Ivanov License: BSD, can be changed to PHP	Status: Finished Result: Accepted Sum of Votes: 11 (1 conditional) Search registered package
» Description
This parser strips down all potentially dangerous content within HTML: opening tag without its closing tag closing tag without its opening tag any of these tags: â€śbaseâ€ť, â€śbasefontâ€ť, â€śheadâ€ť, â€śhtmlâ€ť, â€śbodyâ€ť, â€śappletâ€ť, â€śobjectâ€ť, â€śiframeâ€ť, â€śframeâ€ť, â€śframesetâ€ť, â€śscriptâ€ť, â€ślayerâ€ť, â€śilayerâ€ť, â€śembedâ€ť, â€śbgsoundâ€ť, â€ślinkâ€ť, â€śmetaâ€ť, â€śstyleâ€ť, â€śtitleâ€ť, â€śblinkâ€ť, â€śxmlâ€ť etc. any of these attributes: on, data, dynsrc javascript:/vbscript:/about: etc. protocols expression/behavior etc. in styles any other active content It also tries to convert code to XHTML valid, but htmltidy is far better solution for this task. Advantages comparing to strip_tags: 1. strip_tags works on white-list basis, deleting all tags except allowed. HTML_Safe works on black-list basis, deleting only dangerous content. 2. strip_tags can only strip tags. HTML_safe strips down all active content, including tags, attributes and values of atrributes. 3. strip_tags is not intended to fight XSS. HTML_Safe has primary goal to prevent any XSS attack. 4. strip_tags does not try to produce XHTML compliant code. It does not close unclosed tags. HTML_Safe is successor of SafeHTML project. HTML_Safe fixes all known issues with SafeHTML.
» Dependencies	» Links
XML_HTMLSax3	Package source file (.phps/.htm) Package example (.php)
» Timeline	» Changelog
First Draft: 2005-01-29 Proposal: 2005-01-29 Call for Votes: 2005-02-06	Roman Ivanov [2005-01-30 18:01 UTC] Description updated: added comparison with strip_tags() function. Roman Ivanov [2005-02-06 16:33 UTC] Description updated: relationship with SafeHTML clarified. Code updated: now it seems to be fully compatible with PEAR Coding Standards. Thies C. Arntzen [2023-09-02 18:37 UTC]