webdump

Text-based web client/page dump (experiment)
git clone git://git.codemadness.org/webdump
Log | Files | Refs | README | LICENSE

DateCommit messageAuthorFiles+-
2023-03-20 20:32white-space and wrapping improvementsHiltjo Posthuma1+45-29
2023-03-20 18:54use ctype-like macros, separate print functionsHiltjo Posthuma1+85-37
2023-03-20 18:54xml.c: fix unitialized use of variable c in HTML_MODEHiltjo Posthuma1+6-1
2023-03-19 11:25xml.c: fix-up: don't process pending data twiceHiltjo Posthuma1+4-0
2023-03-19 11:20bump LICENSE yearHiltjo Posthuma1+1-1
2023-03-19 11:16xml.c: some HTML-specific changesHiltjo Posthuma1+17-0
2022-10-06 22:11sync improvements from xmlparserHiltjo Posthuma2+24-20
2022-07-31 10:17bump LICENSE yearHiltjo Posthuma1+1-1
2022-07-31 10:16add BSD err() and errx() for portabilityHiltjo Posthuma1+44-1
2021-10-26 00:05don't try to make absolute URIs absoluteHiltjo Posthuma1+3-2
2021-10-26 00:03small code-style change, unnecesary condition, change to elseHiltjo Posthuma1+3-2
2021-10-25 23:55remove old parseuri() functionHiltjo Posthuma1+0-79
2021-10-25 23:46README: small rewording and add line for the mutt mail clientHiltjo Posthuma1+7-3
2021-10-25 21:45Makefile: use -D_DEFAULT_SOURCE, prevents warnings on some systemsHiltjo Posthuma1+1-1
2021-10-25 21:45enable and fix xmlattrstart handler and parsing src, hrefHiltjo Posthuma1+6-2
2021-10-25 21:42add support for parsing <base href="" />Hiltjo Posthuma1+19-5
2021-10-25 21:42improve man pageHiltjo Posthuma1+16-7
2021-06-27 13:26webdump.1: bump dateHiltjo Posthuma1+1-1
2021-06-27 13:24add -x option to write eXtracted resources to fd 3 in the TSV formatHiltjo Posthuma2+11-2
2021-06-27 13:23sync improved URI parser and formatter from sfeedHiltjo Posthuma1+224-104
2021-06-27 13:22add thinsp to short/common namedentities listHiltjo Posthuma1+1-0
2021-06-27 13:21README: add mailcap example for muttHiltjo Posthuma1+13-0
2021-06-27 13:21LICENSE: bump yearHiltjo Posthuma1+1-1
2020-07-06 17:00sync getnext_ignore fix from grabtitle programHiltjo Posthuma1+1-1
2020-07-06 16:58improve parsing of links with attribute entititiesHiltjo Posthuma1+56-29
2020-05-22 23:04namedentities.h: add a few entities to the common listHiltjo Posthuma1+2-0
2020-03-12 21:33handle closed tag with no corresponding open tag (for malformed HTML)Leonardo Taccari1+4-2
2020-03-12 08:30mini-consistency fixHiltjo Posthuma1+1-1
2020-03-11 14:46improve XML entity conversionHiltjo Posthuma1+8-8
2020-03-11 14:44add initial "optional" tag handling, rework some tag handlingHiltjo Posthuma1+177-103
2019-12-09 23:24use field isvoid from the looked up tag (should be the same)Hiltjo Posthuma1+1-1
2019-12-09 23:22rm unused variableHiltjo Posthuma1+0-2
2019-12-09 23:13hide tags if parent is hiddenHiltjo Posthuma1+9-3
2019-12-09 18:05improve hidden elements, add "hidden" and "aria-hidden" attribute parsingHiltjo Posthuma1+13-3
2019-12-09 18:05refactor bsearch to findtag()Hiltjo Posthuma1+12-5
2019-12-09 18:04rename autoclose to isvoid (void element)Hiltjo Posthuma1+11-4
2019-12-09 17:46hide script, style and template tagHiltjo Posthuma1+3-0
2019-12-09 17:46use binary search for tagsHiltjo Posthuma1+18-8
2019-12-09 17:38sort tagsHiltjo Posthuma1+50-60
2019-12-08 15:41use binary search (bsearch) for named entity lookup on the sorted listHiltjo Posthuma1+23-10
2019-12-08 15:31sort named entities, add laquo, raquo and REG to the common-used listHiltjo Posthuma2+1977-1975
2019-12-08 15:23show img alt textHiltjo Posthuma1+9-0
2019-12-08 15:20README: minor rewordHiltjo Posthuma1+2-2
2019-12-08 15:09simplify link reference type, remove ifdefs and some debug codeHiltjo Posthuma1+1-30
2019-12-08 14:57only show link references at the bottom if there are any, remove #ifdef for itHiltjo Posthuma1+3-9
2019-11-29 14:25add tel: as link reference, only mailto: and tel: may be used in <a>Hiltjo Posthuma1+7-3
2019-11-29 14:22add a case for mailto: urls for link referencesHiltjo Posthuma1+7-2
2019-11-29 14:08update READMEHiltjo Posthuma1+2-0
2019-11-29 14:06disable ANSI codes by default now that there is a -a optionHiltjo Posthuma1+1-1
2019-11-29 14:05underline references/linksHiltjo Posthuma1+4-2
2019-11-29 14:05update READMEHiltjo Posthuma1+34-0
2019-11-29 14:04improvem man page, change -s to -rHiltjo Posthuma2+15-10
2019-11-29 09:49Add ability to toggle rendering optionsWilly Goiffon2+20-1
2019-11-29 09:42Ensure no unnecessary newline is appended at end of dumpWilly Goiffon1+3-1
2019-11-28 21:23named entities: add &reg;Hiltjo Posthuma1+1-0
2019-11-28 21:23update READMEHiltjo Posthuma1+19-5
2019-11-28 21:22"autoclose" certain tags if they are not defined in the short formHiltjo Posthuma1+21-3
2019-11-28 18:27use a list of common-used named entitiesHiltjo Posthuma3+18-2014
2019-11-28 18:18remove local TODOHiltjo Posthuma1+0-60
2019-11-28 18:17don't hide data inside unknown tags, just show it as inlineHiltjo Posthuma1+1-1
2019-11-28 18:16fix formatting inside <pre> containing markupHiltjo Posthuma1+16-7
2019-11-22 12:15disable reverse for <code>Hiltjo Posthuma1+1-1
2019-11-22 12:14add strikethrough, blink. refactor bitmasks for displaytypesHiltjo Posthuma1+39-32
2019-11-22 12:13add all named entities.Hiltjo Posthuma3+2035-19
2019-11-22 12:12add <br> handling in tagstartHiltjo Posthuma1+2-6
2019-11-22 12:12print and reset each style (do not reset all attributes)Hiltjo Posthuma1+27-12
2019-11-22 12:11add one printansi function for printing ANSI codes or notHiltjo Posthuma1+9-1
2019-11-20 23:43print reset sequence first, some small cleanupsHiltjo Posthuma1+6-10
2019-11-20 23:38show usage on invalid flag (or -h)Hiltjo Posthuma1+4-2
2019-11-20 23:37update TODOHiltjo Posthuma1+4-17
2019-11-20 23:37webdump.1: minimal documentation for flagsHiltjo Posthuma1+2-1
2019-11-20 23:36make table cell header (th) and headers boldHiltjo Posthuma1+7-7
2019-11-20 23:36add arg.h and option parsing: -b and -wHiltjo Posthuma3+62-3
2019-11-20 23:24make (soft)line-wrapping an option, still WIP thoughHiltjo Posthuma1+8-8
2019-11-20 23:18define some run-time options, change some optionsHiltjo Posthuma2+33-14
2019-11-20 23:03improve table row and cell rendering a bitHiltjo Posthuma1+2-3
2019-11-20 22:57add formatting improvements from z3braHiltjo Posthuma1+4-4
2019-11-20 22:49rm TODO commentHiltjo Posthuma1+0-4
2019-11-20 22:48Translate semantic tags to ANSI escapesHiltjo Posthuma1+18-5
2019-11-20 22:30disable (soft)-line wrappingHiltjo Posthuma1+7-4
2019-11-20 22:29refactor nbsp handling and named entities to codepointsHiltjo Posthuma2+24-23
2019-11-20 17:29rename count to nchildren, some cleanupsHiltjo Posthuma1+6-13
2019-11-20 17:24improvementsHiltjo Posthuma3+205-88
2019-11-10 19:55Makefile: remove config.mk, simplifyHiltjo Posthuma2+15-22
2019-09-23 07:38update TODOHiltjo Posthuma1+2-0
2019-09-22 18:16man page: remove copy-pasta line, prefix "DEBUG:" for debug lineHiltjo Posthuma2+2-3
2019-09-22 17:54add TODO itemHiltjo Posthuma1+1-0
2019-09-22 17:48fix bug in ignoring character in <style> or <script>Hiltjo Posthuma1+1-1
2019-09-22 17:45absolute link references, add base href argument to programHiltjo Posthuma1+26-27
2019-09-22 17:19use cflags, ldflags, fix directory name (sfeed copy-pasta)Hiltjo Posthuma2+3-3
2019-09-22 17:17fix make distHiltjo Posthuma1+1-2
2019-09-22 17:16add bare minimum man page for nowHiltjo Posthuma1+21-0
2019-09-22 17:14initial MakefileHiltjo Posthuma2+106-3
2019-09-22 17:14rename main.c to webdump.cHiltjo Posthuma2+706-697
2019-09-22 17:10disable display inline (#ifdef), its the default type nowHiltjo Posthuma1+2-0
2019-09-21 18:02testing improve white-space handlingHiltjo Posthuma3+45-21
2019-09-21 17:14hide tag if DisplayNone, separate white-space handlingHiltjo Posthuma2+10-11
2019-09-21 14:43show "code" as pre for now, disable showing links for nowHiltjo Posthuma1+3-1
2019-09-21 14:37add a few HTML5 tags, update TODO, tiny cleanupHiltjo Posthuma2+12-3
2019-09-21 14:31add strlcat, strlcpy, update README, TODO and display "code" inlineHiltjo Posthuma5+112-4
24 more commits remaining, fetch the repository