diff options
author | June McEnroe <june@causal.agency> | 2021-09-07 16:53:43 -0400 |
---|---|---|
committer | June McEnroe <june@causal.agency> | 2021-09-07 16:53:43 -0400 |
commit | f5235d92eeb7477fa459b7f5665873c0c23452ff (patch) | |
tree | 97d748190cdbd79f1690a22527d527e9e778b462 /bin/man1 | |
parent | Show about path in page title (diff) | |
download | src-f5235d92eeb7477fa459b7f5665873c0c23452ff.tar.gz src-f5235d92eeb7477fa459b7f5665873c0c23452ff.zip |
Add dehtml
Diffstat (limited to 'bin/man1')
-rw-r--r-- | bin/man1/dehtml.1 | 35 |
1 files changed, 35 insertions, 0 deletions
diff --git a/bin/man1/dehtml.1 b/bin/man1/dehtml.1 new file mode 100644 index 00000000..a0c5a8c4 --- /dev/null +++ b/bin/man1/dehtml.1 @@ -0,0 +1,35 @@ +.Dd September 7, 2021 +.Dt DEHTML 1 +.Os +. +.Sh NAME +.Nm dehtml +.Nd extract text from HTML +. +.Sh SYNOPSIS +.Nm +.Op Fl s +.Op Ar +. +.Sh DESCRIPTION +The +.Nm +utility extracts text +from HTML documents. +Text inside +.Sy <title> , +.Sy <style> +and +.Sy <script> +tags is discarded. +Numeric and common named HTML entities +are converted. +. +.Pp +The arguments are as follows: +.Bl -tag -width Ds +.It Fl s +Collapse whitespace outside of +.Sy <pre> +tags. +.El |