tscrape, branch HEADtwitter scraper (not working anymore)
291870afea3ad56366e82efaceef0f1288b340182021-07-20T13:20:17Z2021-07-20T13:20:17ZREADME: small improvementHiltjo Posthumahiltjo@codemadness.orgcommit 291870afea3ad56366e82efaceef0f1288b34018
parent f0b2764a08ed2e73dc098f7b4792b81284624944
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Tue, 20 Jul 2021 15:20:17 +0200
README: small improvement
f0b2764a08ed2e73dc098f7b4792b812846249442021-07-20T12:49:43Z2021-07-20T12:49:43ZREADME: fix twitter to Atom (sfeed) exampleHiltjo Posthumahiltjo@codemadness.orgcommit f0b2764a08ed2e73dc098f7b4792b81284624944
parent ca46299e72f8e28b0f9211b55ccc65e62eaa9306
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Tue, 20 Jul 2021 14:49:43 +0200
README: fix twitter to Atom (sfeed) example
Thanks Nathaniel for reporting it!
ca46299e72f8e28b0f9211b55ccc65e62eaa93062021-03-20T14:53:18Z2021-03-20T14:53:18Zbump LICENSE yearHiltjo Posthumahiltjo@codemadness.orgcommit ca46299e72f8e28b0f9211b55ccc65e62eaa9306
parent 34ed10b505310477e31b608a28470e2e997eb4ea
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 20 Mar 2021 15:53:18 +0100
bump LICENSE year
34ed10b505310477e31b608a28470e2e997eb4ea2021-03-19T10:25:29Z2021-03-19T10:25:29Zupdate static bearer tokenHiltjo Posthumahiltjo@codemadness.orgcommit 34ed10b505310477e31b608a28470e2e997eb4ea
parent 76a1794bec9f2e64572537abce0258827ce93a80
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 19 Mar 2021 11:25:29 +0100
update static bearer token
This uses the same bearer as youtube-dl.
Thanks frobnitz!
76a1794bec9f2e64572537abce0258827ce93a802020-10-01T18:23:15Z2020-10-01T18:23:15Zbump version to 0.6Hiltjo Posthumahiltjo@codemadness.orgcommit 76a1794bec9f2e64572537abce0258827ce93a80
parent cdfc383b166a112646ab21596442d8fb976bc311
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Thu, 1 Oct 2020 20:23:15 +0200
bump version to 0.6
cdfc383b166a112646ab21596442d8fb976bc3112020-10-01T18:22:18Z2020-10-01T18:22:18Ztscrape: remove debugging commentsHiltjo Posthumahiltjo@codemadness.orgcommit cdfc383b166a112646ab21596442d8fb976bc311
parent acef6f3969b21a48512c389febf2e6ca9089ebfc
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Thu, 1 Oct 2020 20:22:18 +0200
tscrape: remove debugging comments
acef6f3969b21a48512c389febf2e6ca9089ebfc2020-10-01T18:21:44Z2020-10-01T18:21:44Ztscraperc.example: change feedsHiltjo Posthumahiltjo@codemadness.orgcommit acef6f3969b21a48512c389febf2e6ca9089ebfc
parent dc41402d290af10b6695824342339cba108cc6c0
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Thu, 1 Oct 2020 20:21:44 +0200
tscraperc.example: change feeds
dc41402d290af10b6695824342339cba108cc6c02020-10-01T18:10:06Z2020-10-01T18:10:06Ztscrape_update: escape { and } in sed expressionHiltjo Posthumahiltjo@codemadness.orgcommit dc41402d290af10b6695824342339cba108cc6c0
parent 0ec2e56f6971b7e33e84fc35aeaa796f9044554e
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Thu, 1 Oct 2020 20:10:06 +0200
tscrape_update: escape { and } in sed expression
Thanks anthk for reporting it.
0ec2e56f6971b7e33e84fc35aeaa796f9044554e2020-06-19T12:05:06Z2020-06-19T12:05:06Ztscrape_update: increase timeline limit from 50 to 100Hiltjo Posthumahiltjo@codemadness.orgcommit 0ec2e56f6971b7e33e84fc35aeaa796f9044554e
parent ff8d2ecaed4cb56e6cc1ccdc4a43e1a3e45eb61f
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 19 Jun 2020 14:05:06 +0200
tscrape_update: increase timeline limit from 50 to 100
ff8d2ecaed4cb56e6cc1ccdc4a43e1a3e45eb61f2020-06-06T22:08:42Z2020-06-06T22:08:42Zrefactor urls into general replacement function and replace some HTML entitiesHiltjo Posthumahiltjo@codemadness.orgcommit ff8d2ecaed4cb56e6cc1ccdc4a43e1a3e45eb61f
parent 62a3853f6428208e5be727175479ebcede127497
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 7 Jun 2020 00:08:42 +0200
refactor urls into general replacement function and replace some HTML entities
62a3853f6428208e5be727175479ebcede1274972020-06-06T10:51:50Z2020-06-06T10:51:50ZREADME: some rewordingHiltjo Posthumahiltjo@codemadness.orgcommit 62a3853f6428208e5be727175479ebcede127497
parent 5ca572b963ed7b64485d016eec950ca3a646d107
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 6 Jun 2020 12:51:50 +0200
README: some rewording
5ca572b963ed7b64485d016eec950ca3a646d1072020-06-06T10:47:51Z2020-06-06T10:47:51Zrm code thats not needed anymore and a debug/test commentHiltjo Posthumahiltjo@codemadness.orgcommit 5ca572b963ed7b64485d016eec950ca3a646d107
parent 86910f72369e655ce2db017e64646def543627b9
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 6 Jun 2020 12:47:51 +0200
rm code thats not needed anymore and a debug/test comment
86910f72369e655ce2db017e64646def543627b92020-06-06T00:23:37Z2020-06-06T10:40:37ZExpand all photos URLsLeonardo Taccariiamleot@gmail.comcommit 86910f72369e655ce2db017e64646def543627b9
parent 9cc1b7e985affd8764385d3d6cf6476804230cdd
Author: Leonardo Taccari <iamleot@gmail.com>
Date: Sat, 6 Jun 2020 02:23:37 +0200
Expand all photos URLs
9cc1b7e985affd8764385d3d6cf6476804230cdd2020-06-05T23:50:22Z2020-06-06T10:40:37ZExpand extended_entities URLs in retweetsLeonardo Taccariiamleot@gmail.comcommit 9cc1b7e985affd8764385d3d6cf6476804230cdd
parent d270b9dc10bc3d05b096f2dd34256dc9b962b951
Author: Leonardo Taccari <iamleot@gmail.com>
Date: Sat, 6 Jun 2020 01:50:22 +0200
Expand extended_entities URLs in retweets
d270b9dc10bc3d05b096f2dd34256dc9b962b9512020-06-05T23:40:36Z2020-06-06T10:40:37ZUse extended_entities instead of entitiesLeonardo Taccariiamleot@gmail.comcommit d270b9dc10bc3d05b096f2dd34256dc9b962b951
parent e2c4c24378d937edd6f9d717267d9f08b268df78
Author: Leonardo Taccari <iamleot@gmail.com>
Date: Sat, 6 Jun 2020 01:40:36 +0200
Use extended_entities instead of entities
Possible medias could be omitted in entities but present in
extended_entities.
e2c4c24378d937edd6f9d717267d9f08b268df782020-06-05T23:11:57Z2020-06-06T10:40:37ZFurther expand URLs in retweetsLeonardo Taccariiamleot@gmail.comcommit e2c4c24378d937edd6f9d717267d9f08b268df78
parent d204e2373cc9f7e3f3afa3d4f2afb7976f67b4ae
Author: Leonardo Taccari <iamleot@gmail.com>
Date: Sat, 6 Jun 2020 01:11:57 +0200
Further expand URLs in retweets
d204e2373cc9f7e3f3afa3d4f2afb7976f67b4ae2020-06-05T15:19:39Z2020-06-05T15:19:39Zupdate man pages a bit for the recent changesHiltjo Posthumahiltjo@codemadness.orgcommit d204e2373cc9f7e3f3afa3d4f2afb7976f67b4ae
parent 7ed31cfe093fa52711f0bdd407abc1e4d67f11b3
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 5 Jun 2020 17:19:39 +0200
update man pages a bit for the recent changes
7ed31cfe093fa52711f0bdd407abc1e4d67f11b32020-06-05T15:18:40Z2020-06-05T15:18:40Zreplace all whitespace by a single spaceHiltjo Posthumahiltjo@codemadness.orgcommit 7ed31cfe093fa52711f0bdd407abc1e4d67f11b3
parent d4929d30226753c84a2ab5d8b703dd7df69eb4c1
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 5 Jun 2020 17:18:40 +0200
replace all whitespace by a single space
d4929d30226753c84a2ab5d8b703dd7df69eb4c12020-06-05T15:17:17Z2020-06-05T15:17:17ZFix an off-by-one in printexpand() by leotHiltjo Posthumahiltjo@codemadness.orgcommit d4929d30226753c84a2ab5d8b703dd7df69eb4c1
parent 57259fbc5868fc4c8225640aaca8fd496dd6fc6c
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 5 Jun 2020 17:17:17 +0200
Fix an off-by-one in printexpand() by leot
Thanks leot!
57259fbc5868fc4c8225640aaca8fd496dd6fc6c2020-06-05T15:16:14Z2020-06-05T15:16:14Zsimplify parsejson error checkingHiltjo Posthumahiltjo@codemadness.orgcommit 57259fbc5868fc4c8225640aaca8fd496dd6fc6c
parent e386c4bbe4e59ce447c921b9063318e98318c0f1
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 5 Jun 2020 17:16:14 +0200
simplify parsejson error checking
e386c4bbe4e59ce447c921b9063318e98318c0f12020-06-05T15:15:34Z2020-06-05T15:15:34Zreplace newline to space instead of removing themHiltjo Posthumahiltjo@codemadness.orgcommit e386c4bbe4e59ce447c921b9063318e98318c0f1
parent c3e76b0f57c58b284cd13ce008c082525c8ee28a
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 5 Jun 2020 17:15:34 +0200
replace newline to space instead of removing them
c3e76b0f57c58b284cd13ce008c082525c8ee28a2020-06-05T12:51:58Z2020-06-05T12:51:58Zwork-in-progress: support the new Twitter siteHiltjo Posthumahiltjo@codemadness.orgcommit c3e76b0f57c58b284cd13ce008c082525c8ee28a
parent 663dab7d9883a291ed570a743fb89a16e1a01d85
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 5 Jun 2020 14:51:58 +0200
work-in-progress: support the new Twitter site
Scraping doesn't work anymore. Use the Twitter JSON API.
Major thanks to leot for helping with this.
663dab7d9883a291ed570a743fb89a16e1a01d852020-06-01T10:17:51Z2020-06-01T10:17:51Zfix typoHiltjo Posthumahiltjo@codemadness.orgcommit 663dab7d9883a291ed570a743fb89a16e1a01d85
parent 4eb084c604ac84247e704bd05a4737c8679461e2
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Mon, 1 Jun 2020 12:17:51 +0200
fix typo
4eb084c604ac84247e704bd05a4737c8679461e22020-05-04T11:37:42Z2020-05-04T11:37:42Zbump version to 0.5Hiltjo Posthumahiltjo@codemadness.orgcommit 4eb084c604ac84247e704bd05a4737c8679461e2
parent b83b7243ecfafc432cf1807f69c9ab30068fbd1f
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Mon, 4 May 2020 13:37:42 +0200
bump version to 0.5
b83b7243ecfafc432cf1807f69c9ab30068fbd1f2020-05-02T13:15:50Z2020-05-02T15:00:04Zstyle.css: improve horizontal scrolling for long titles/small windowsHiltjo Posthumahiltjo@codemadness.orgcommit b83b7243ecfafc432cf1807f69c9ab30068fbd1f
parent 8e11f80c105f02b55641d9068d03448a6ffc1ea3
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 2 May 2020 15:15:50 +0200
style.css: improve horizontal scrolling for long titles/small windows
8e11f80c105f02b55641d9068d03448a6ffc1ea32020-03-20T18:09:49Z2020-03-20T18:09:49Zutil: printescape(): it's enough to set a flag, no counter neededHiltjo Posthumahiltjo@codemadness.orgcommit 8e11f80c105f02b55641d9068d03448a6ffc1ea3
parent 9a0c90f26b3eef023910a699c7385b709a01ce90
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 20 Mar 2020 19:09:49 +0100
util: printescape(): it's enough to set a flag, no counter needed
9a0c90f26b3eef023910a699c7385b709a01ce902020-03-20T17:37:11Z2020-03-20T17:54:21Zdocumentation: no need to explicitly set HTTP/1Hiltjo Posthumahiltjo@codemadness.orgcommit 9a0c90f26b3eef023910a699c7385b709a01ce90
parent e03dec6b97f74451786b3698c9fab1d0d45b9cd9
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 20 Mar 2020 18:37:11 +0100
documentation: no need to explicitly set HTTP/1
(HTTP/2+ still sucks though)
e03dec6b97f74451786b3698c9fab1d0d45b9cd92020-03-20T17:18:17Z2020-03-20T17:54:21Ztscrape_update: tiny style fix in commentHiltjo Posthumahiltjo@codemadness.orgcommit e03dec6b97f74451786b3698c9fab1d0d45b9cd9
parent c427d54a874baa5c3f0cd0d725b90efa27cf34d2
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 20 Mar 2020 18:18:17 +0100
tscrape_update: tiny style fix in comment
c427d54a874baa5c3f0cd0d725b90efa27cf34d22020-03-20T17:15:12Z2020-03-20T17:54:21Ztscrape_html: use a <pre> section per feedHiltjo Posthumahiltjo@codemadness.orgcommit c427d54a874baa5c3f0cd0d725b90efa27cf34d2
parent b0413f42bd2bc31cbbb5e338093de51b94cfd028
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 20 Mar 2020 18:15:12 +0100
tscrape_html: use a <pre> section per feed
This improves output with Dillo, w3m and possibly other simple browsers.
- Dillo has a bug where it resets its block-style after <h2> when it is inside
<pre>. Then it ignores newlines (inside <pre>) and the links are inlined.
- w3m does not have a (line)margin for h2.
b0413f42bd2bc31cbbb5e338093de51b94cfd0282020-03-20T11:03:26Z2020-03-20T17:53:56Zman page improvements (sync)Hiltjo Posthumahiltjo@codemadness.orgcommit b0413f42bd2bc31cbbb5e338093de51b94cfd028
parent 423d3f5ad6023be3eb50ebe2f9504309bfe3d940
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 20 Mar 2020 12:03:26 +0100
man page improvements (sync)
- tscraperc.5: use the same order as executed in the tscrape_update file.
- tscraperc.5: reference curl, which are optional, but used by default.
- tscraperc.5: use a .Sh VARIABLES section for tscrapepath and maxjobs.
- tscrape_update.1: split config format-specific documentation and reference it.
- just use the term "url" instead of "uri".
- shorten some texts, increasing readability.
- document exit status of tools.
fix:
- do not reference RSS/Atom.
423d3f5ad6023be3eb50ebe2f9504309bfe3d9402020-03-20T11:02:41Z2020-03-20T11:02:41Ztscrape_html/tscrape_plain: like sfeed, skip when a timestamp is empty/invalidHiltjo Posthumahiltjo@codemadness.orgcommit 423d3f5ad6023be3eb50ebe2f9504309bfe3d940
parent d34cbfb8d0bc3d985ce58ebfa3034115a0ef323d
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 20 Mar 2020 12:02:41 +0100
tscrape_html/tscrape_plain: like sfeed, skip when a timestamp is empty/invalid
d34cbfb8d0bc3d985ce58ebfa3034115a0ef323d2020-03-20T11:01:35Z2020-03-20T11:01:35ZREADME: don't set LC_ALL for awk exampleHiltjo Posthumahiltjo@codemadness.orgcommit d34cbfb8d0bc3d985ce58ebfa3034115a0ef323d
parent 573905aec2e99fbe31a1cabe5864853ef9015a41
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 20 Mar 2020 12:01:35 +0100
README: don't set LC_ALL for awk example
573905aec2e99fbe31a1cabe5864853ef9015a412020-03-20T11:00:16Z2020-03-20T11:00:16Zsync printutf8pad from sfeedHiltjo Posthumahiltjo@codemadness.orgcommit 573905aec2e99fbe31a1cabe5864853ef9015a41
parent 426522824e719e081c9c5e47ba8771779b0fdc85
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 20 Mar 2020 12:00:16 +0100
sync printutf8pad from sfeed
changes:
- util: printutf8pad: proper counting of multiwidth characters
for example the string "\xef\xbc\xb5".
- optimization
426522824e719e081c9c5e47ba8771779b0fdc852020-03-19T22:52:19Z2020-03-20T08:39:58Zignore possible consecutive `js-stream-item' when parsing a single tweetLeonardo Taccariiamleot@gmail.comcommit 426522824e719e081c9c5e47ba8771779b0fdc85
parent 6654f1b01d68e2b2ff7aa660cd678c1cba4d062f
Author: Leonardo Taccari <iamleot@gmail.com>
Date: Thu, 19 Mar 2020 23:52:19 +0100
ignore possible consecutive `js-stream-item' when parsing a single tweet
6654f1b01d68e2b2ff7aa660cd678c1cba4d062f2020-02-23T19:36:28Z2020-02-23T19:36:28Zbump version to 0.4Hiltjo Posthumahiltjo@codemadness.orgcommit 6654f1b01d68e2b2ff7aa660cd678c1cba4d062f
parent e22be126773aae620aadb29b9757824dc1060868
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 23 Feb 2020 20:36:28 +0100
bump version to 0.4
e22be126773aae620aadb29b9757824dc10608682020-02-23T19:18:01Z2020-02-23T19:18:01Ztscrape_update: don't preserve permissions of tmp files by moving, so copyHiltjo Posthumahiltjo@codemadness.orgcommit e22be126773aae620aadb29b9757824dc1060868
parent 1b03719564afdb0ba61fa1599d0cb796e10d6ed9
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 23 Feb 2020 20:18:01 +0100
tscrape_update: don't preserve permissions of tmp files by moving, so copy
noticed on DragonFlyBSD where it prints a warning when moving the file from
/tmp.
To reproduce it:
touch /tmp/file
mv /tmp/file ~/
On other systems this would not print a warning, but it would preserve the
group permissions etc.
1b03719564afdb0ba61fa1599d0cb796e10d6ed92020-02-01T14:56:04Z2020-02-01T14:56:04Zcleanup some more includesHiltjo Posthumahiltjo@codemadness.orgcommit 1b03719564afdb0ba61fa1599d0cb796e10d6ed9
parent 07ed8752bbdf330e170b53dc29f35044cbbbe958
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 1 Feb 2020 15:56:04 +0100
cleanup some more includes
07ed8752bbdf330e170b53dc29f35044cbbbe9582020-02-01T14:30:37Z2020-02-01T14:30:37Zrm example.shHiltjo Posthumahiltjo@codemadness.orgcommit 07ed8752bbdf330e170b53dc29f35044cbbbe958
parent 64d62523dd2ee855258cef621a0942875910e416
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 1 Feb 2020 15:30:37 +0100
rm example.sh
the field order was changed and tscrape_plain is a better use now.
64d62523dd2ee855258cef621a0942875910e4162020-02-01T14:13:52Z2020-02-01T14:13:52Zbump LICENSE to 2020Hiltjo Posthumahiltjo@codemadness.orgcommit 64d62523dd2ee855258cef621a0942875910e416
parent 0e4b407d02da9f565a5ba6def26db22565925cbd
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 1 Feb 2020 15:13:52 +0100
bump LICENSE to 2020
0e4b407d02da9f565a5ba6def26db22565925cbd2020-02-01T14:13:33Z2020-02-01T14:13:33Zcleanup includes and remove unused fields from struct feedHiltjo Posthumahiltjo@codemadness.orgcommit 0e4b407d02da9f565a5ba6def26db22565925cbd
parent c0ae8d7b1f540d5cda35df1abe323aecf8ab3ff5
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 1 Feb 2020 15:13:33 +0100
cleanup includes and remove unused fields from struct feed
c0ae8d7b1f540d5cda35df1abe323aecf8ab3ff52020-02-01T14:09:38Z2020-02-01T14:09:38Zstyle.css: sort propertiesHiltjo Posthumahiltjo@codemadness.orgcommit c0ae8d7b1f540d5cda35df1abe323aecf8ab3ff5
parent 8726a24365b8f746bcbb48a907a90afc8031b2f2
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 1 Feb 2020 15:09:38 +0100
style.css: sort properties
8726a24365b8f746bcbb48a907a90afc8031b2f22020-02-01T14:05:02Z2020-02-01T14:05:02ZREADME: small changeHiltjo Posthumahiltjo@codemadness.orgcommit 8726a24365b8f746bcbb48a907a90afc8031b2f2
parent e24125d0f7679e0612c7ed69d79f07654f10637d
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 1 Feb 2020 15:05:02 +0100
README: small change
e24125d0f7679e0612c7ed69d79f07654f10637d2020-02-01T14:03:00Z2020-02-01T14:03:00ZMakefile: simplify Makefile, rm config.mk, use system cflags/ldflagsHiltjo Posthumahiltjo@codemadness.orgcommit e24125d0f7679e0612c7ed69d79f07654f10637d
parent 5df58d27f557292778cdc5dee306f18db8c980f7
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 1 Feb 2020 15:03:00 +0100
Makefile: simplify Makefile, rm config.mk, use system cflags/ldflags
5df58d27f557292778cdc5dee306f18db8c980f72020-02-01T14:02:27Z2020-02-01T14:02:27Zsync XML improvementsHiltjo Posthumahiltjo@codemadness.orgcommit 5df58d27f557292778cdc5dee306f18db8c980f7
parent f8629e681a16fc3af086355a44c942df57291b4b
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 1 Feb 2020 15:02:27 +0100
sync XML improvements
f8629e681a16fc3af086355a44c942df57291b4b2019-08-18T15:06:09Z2019-08-18T15:06:09Zbump version to 0.3Hiltjo Posthumahiltjo@codemadness.orgcommit f8629e681a16fc3af086355a44c942df57291b4b
parent 2bc2be55ea8cc026fc822ee8031dfd49bb77d7cc
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 18 Aug 2019 17:06:09 +0200
bump version to 0.3
2bc2be55ea8cc026fc822ee8031dfd49bb77d7cc2019-08-18T12:01:29Z2019-08-18T12:01:29Zman pages: fix mandoc lint warnings + wording tweaksHiltjo Posthumahiltjo@codemadness.orgcommit 2bc2be55ea8cc026fc822ee8031dfd49bb77d7cc
parent f05f3eb6c90f7b1baf7369498609dc5d5d212b63
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 18 Aug 2019 14:01:29 +0200
man pages: fix mandoc lint warnings + wording tweaks
f05f3eb6c90f7b1baf7369498609dc5d5d212b632019-08-17T10:10:00Z2019-08-17T10:10:00Zman page documentation for all tools: copied from sfeed and changedHiltjo Posthumahiltjo@codemadness.orgcommit f05f3eb6c90f7b1baf7369498609dc5d5d212b63
parent 797f715398aeac08febc8067ed6423da727e4f45
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 17 Aug 2019 12:10:00 +0200
man page documentation for all tools: copied from sfeed and changed
initial version, needs some more work.
797f715398aeac08febc8067ed6423da727e4f452019-08-02T16:46:06Z2019-08-02T16:46:06Zxml: improve cdata and comment callback logicHiltjo Posthumahiltjo@codemadness.orgcommit 797f715398aeac08febc8067ed6423da727e4f45
parent 51995d6fc4760fadac68650bb82773b9bf9eae79
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 2 Aug 2019 18:46:06 +0200
xml: improve cdata and comment callback logic
it used to call both handlers twice at the end for "-->" (comment) or "]]>"
(CDATA) with the data "" and length 0.
Now it is only called when non-empty. The start and end handlers can still be
used.
51995d6fc4760fadac68650bb82773b9bf9eae792019-08-02T16:33:10Z2019-08-02T16:33:10Ztscrape_update: sync improvements from sfeed_updateHiltjo Posthumahiltjo@codemadness.orgcommit 51995d6fc4760fadac68650bb82773b9bf9eae79
parent db47c97bea3370886d011a2c950ead2551cf3fbc
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 2 Aug 2019 18:33:10 +0200
tscrape_update: sync improvements from sfeed_update
- change order of functions in script and documentation to match the execution
order.
- improve a comment about the parallel processing behaviour (performance stall).
db47c97bea3370886d011a2c950ead2551cf3fbc2019-05-12T17:20:49Z2019-05-12T17:20:49Ztscrape_update improvementsHiltjo Posthumahiltjo@codemadness.orgcommit db47c97bea3370886d011a2c950ead2551cf3fbc
parent 5e6e62cf3522747a7c4573736d774503ff139a12
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 12 May 2019 19:20:49 +0200
tscrape_update improvements
- Better checking and verbose logging (on failure) of each stage:
fetchfeed, filter, merge, order, convertencoding. This makes sure on out-of-memory,
disk-space or other resource limits the output is not corrupted.
- This also has the added advantage it runs less processes (piped) at the same
time.
- Clear previous unneeded file to preserve space in /tmp
(/tmp is often mounted as mfs/tmpfs).
- Rename fetchfeed to fetch.
- Add logging function (able to override), use more logical logging format (pun
intended).
- Code-style: order overridable functions in execution order.
5e6e62cf3522747a7c4573736d774503ff139a122019-05-12T17:19:37Z2019-05-12T17:19:37ZREADME: add preface text, list dependenciesHiltjo Posthumahiltjo@codemadness.orgcommit 5e6e62cf3522747a7c4573736d774503ff139a12
parent 2f683439be0f889e05e42965926d44a7332f042d
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 12 May 2019 19:19:37 +0200
README: add preface text, list dependencies
2f683439be0f889e05e42965926d44a7332f042d2019-05-12T17:19:29Z2019-05-12T17:19:29Zexample: fix typo in nameHiltjo Posthumahiltjo@codemadness.orgcommit 2f683439be0f889e05e42965926d44a7332f042d
parent fb64d1d7eb24caab8ca7fb574ffad5886ff8f05f
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 12 May 2019 19:19:29 +0200
example: fix typo in name
fb64d1d7eb24caab8ca7fb574ffad5886ff8f05f2019-05-12T16:58:37Z2019-05-12T16:58:37Ztscrape_update: disable If-Modified-Since by defaultHiltjo Posthumahiltjo@codemadness.orgcommit fb64d1d7eb24caab8ca7fb574ffad5886ff8f05f
parent 0adbb9ec4d4f8e92643b1dfc737c1780a1c4c7a3
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 12 May 2019 18:58:37 +0200
tscrape_update: disable If-Modified-Since by default
0adbb9ec4d4f8e92643b1dfc737c1780a1c4c7a32019-04-23T19:07:24Z2019-04-23T19:07:24Zbump version to 0.2Hiltjo Posthumahiltjo@codemadness.orgcommit 0adbb9ec4d4f8e92643b1dfc737c1780a1c4c7a3
parent 33e40c9fc70a418c65d6de3cc68c1c3431f4cd47
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Tue, 23 Apr 2019 21:07:24 +0200
bump version to 0.2
33e40c9fc70a418c65d6de3cc68c1c3431f4cd472019-04-23T19:06:48Z2019-04-23T19:06:48ZMakefile: make it simpler to not compile compat objectsHiltjo Posthumahiltjo@codemadness.orgcommit 33e40c9fc70a418c65d6de3cc68c1c3431f4cd47
parent f5f136fabf5f8cd7245d963e7e479442200738e7
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Tue, 23 Apr 2019 21:06:48 +0200
Makefile: make it simpler to not compile compat objects
on OpenBSD: make COMPATOBJ=
f5f136fabf5f8cd7245d963e7e479442200738e72019-04-23T19:06:31Z2019-04-23T19:06:31Zutil.h: remove unused macro ISUTF8Hiltjo Posthumahiltjo@codemadness.orgcommit f5f136fabf5f8cd7245d963e7e479442200738e7
parent 8af4c93ad7dee6454abd9da106b9f3e37e1ce400
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Tue, 23 Apr 2019 21:06:31 +0200
util.h: remove unused macro ISUTF8
8af4c93ad7dee6454abd9da106b9f3e37e1ce4002019-04-22T12:46:41Z2019-04-22T12:46:41ZREADME: update, add twitter to Atom exampleHiltjo Posthumahiltjo@codemadness.orgcommit 8af4c93ad7dee6454abd9da106b9f3e37e1ce400
parent 2872a29d4f44afbfa4f439ba1f3d84c22114b0d4
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Mon, 22 Apr 2019 14:46:41 +0200
README: update, add twitter to Atom example
2872a29d4f44afbfa4f439ba1f3d84c22114b0d42019-04-22T12:46:30Z2019-04-22T12:46:30Zsync XML improvementsHiltjo Posthumahiltjo@codemadness.orgcommit 2872a29d4f44afbfa4f439ba1f3d84c22114b0d4
parent bd299de160e8f56d6f88538d9d4d4ded4775038d
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Mon, 22 Apr 2019 14:46:30 +0200
sync XML improvements
bd299de160e8f56d6f88538d9d4d4ded4775038d2019-02-22T12:11:48Z2019-02-22T12:11:48ZMakefile / make dist: there is no TODO fileHiltjo Posthumahiltjo@codemadness.orgcommit bd299de160e8f56d6f88538d9d4d4ded4775038d
parent 0743e76986d211f2a2695410c7678275355341d0
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 22 Feb 2019 13:11:48 +0100
Makefile / make dist: there is no TODO file
0743e76986d211f2a2695410c7678275355341d02019-02-22T12:09:12Z2019-02-22T12:09:12ZMakefile: add DOCPREFIX for installing docs in ports, use system {C,LD}FLAGSHiltjo Posthumahiltjo@codemadness.orgcommit 0743e76986d211f2a2695410c7678275355341d0
parent 1e8d328abcab552a8a58fee93eadada8b42148f7
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 22 Feb 2019 13:09:12 +0100
Makefile: add DOCPREFIX for installing docs in ports, use system {C,LD}FLAGS
change installed doc from /usr/local/share/tscrape to /usr/local/share/doc/tscrape
1e8d328abcab552a8a58fee93eadada8b42148f72019-02-22T12:02:22Z2019-02-22T12:02:22Zxml: remove unnecesary checksHiltjo Posthumahiltjo@codemadness.orgcommit 1e8d328abcab552a8a58fee93eadada8b42148f7
parent b11135a350c346d434708139583668e95b96427f
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 22 Feb 2019 13:02:22 +0100
xml: remove unnecesary checks
- reduce amount of data to check.
- remove unnecesary checks from (now) internal functions.
b11135a350c346d434708139583668e95b96427f2018-12-18T17:10:33Z2018-12-18T17:10:33Zrename getchar_ignore to getnext_ignoreHiltjo Posthumahiltjo@codemadness.orgcommit b11135a350c346d434708139583668e95b96427f
parent 36af5420b34500f32a5a8d3ee6601e57f3619bf8
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Tue, 18 Dec 2018 18:10:33 +0100
rename getchar_ignore to getnext_ignore
36af5420b34500f32a5a8d3ee6601e57f3619bf82018-12-17T17:55:32Z2018-12-17T17:55:32Ztscrape.1: formatted timestamp field was removed a long time agoHiltjo Posthumahiltjo@codemadness.orgcommit 36af5420b34500f32a5a8d3ee6601e57f3619bf8
parent 0368be87df8da32cab27d2d5fab28c053d352016
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Mon, 17 Dec 2018 18:55:32 +0100
tscrape.1: formatted timestamp field was removed a long time ago
0368be87df8da32cab27d2d5fab28c053d3520162018-12-17T17:40:28Z2018-12-17T17:40:28Ztscrape_update: remove unused terminated() functionHiltjo Posthumahiltjo@codemadness.orgcommit 0368be87df8da32cab27d2d5fab28c053d352016
parent ed3a979265abe557e783ea22c6a09fb96241ff95
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Mon, 17 Dec 2018 18:40:28 +0100
tscrape_update: remove unused terminated() function
ed3a979265abe557e783ea22c6a09fb96241ff952018-12-17T17:32:50Z2018-12-17T17:32:50Zignore incorrect unescaped HTML in <style> or <script> in a better wayHiltjo Posthumahiltjo@codemadness.orgcommit ed3a979265abe557e783ea22c6a09fb96241ff95
parent 0fac9621c44b76c38d911438b1966d665e3b8134
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Mon, 17 Dec 2018 18:32:50 +0100
ignore incorrect unescaped HTML in <style> or <script> in a better way
0fac9621c44b76c38d911438b1966d665e3b81342018-12-17T17:25:08Z2018-12-17T17:25:08ZXML tag parse improvements for PI and end tagsHiltjo Posthumahiltjo@codemadness.orgcommit 0fac9621c44b76c38d911438b1966d665e3b8134
parent 24fad792de3bab17f1cf485450435761fb3b8657
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Mon, 17 Dec 2018 18:25:08 +0100
XML tag parse improvements for PI and end tags
- Stricter parsing of tags, no whitespace stripping after <.
- For end tags the "internal" context x->tag would be "/sometag". Make sure
this matches exactly with the parameter tag.
- Reset tagname after parsing an end tag.
- Make end tag handling more consistent.
- Remove temporary variable taglen.
24fad792de3bab17f1cf485450435761fb3b86572018-12-17T17:23:35Z2018-12-17T17:23:35ZMakefile: just use OpenBSD #ifdef for pledge(2)Hiltjo Posthumahiltjo@codemadness.orgcommit 24fad792de3bab17f1cf485450435761fb3b8657
parent e9035ce73e795646108130d6f0e3e4f3be30e46a
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Mon, 17 Dec 2018 18:23:35 +0100
Makefile: just use OpenBSD #ifdef for pledge(2)
e9035ce73e795646108130d6f0e3e4f3be30e46a2018-12-17T17:16:08Z2018-12-17T17:21:49Ztscrape_update: Sync with sfeed_updateLeonardo Taccariiamleot@gmail.comcommit e9035ce73e795646108130d6f0e3e4f3be30e46a
parent 8ff19bed65e82fddf4c01543eaa536863b378fc2
Author: Leonardo Taccari <iamleot@gmail.com>
Date: Mon, 17 Dec 2018 18:16:08 +0100
tscrape_update: Sync with sfeed_update
- Handle signals consistently in different shells
- Improve SIGINT handling
- Add a variable for max amount of feeds to update concurrently
- Add filter(), order() support per feed
- Don't always exit 1, exit 130 on SIGINT, exit 0 otherwise
- Fail on feed HTTP redirect
--http1.0 curl option was not removed (it is not present in sfeed_update)
to avoid HTTP/2.
8ff19bed65e82fddf4c01543eaa536863b378fc22018-09-07T17:36:50Z2018-09-07T17:36:50Zutil: fix UB with ctype functionsHiltjo Posthumahiltjo@codemadness.orgcommit 8ff19bed65e82fddf4c01543eaa536863b378fc2
parent 1bd861708376c35e3b43e9e0720ff5248d3050d3
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 7 Sep 2018 19:36:50 +0200
util: fix UB with ctype functions
1bd861708376c35e3b43e9e0720ff5248d3050d32018-09-07T17:35:38Z2018-09-07T17:35:38Zfix UB with ctype functionsHiltjo Posthumahiltjo@codemadness.orgcommit 1bd861708376c35e3b43e9e0720ff5248d3050d3
parent aec6f674a5888e14a217ae1944ca2d3d0e790a6a
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 7 Sep 2018 19:35:38 +0200
fix UB with ctype functions
aec6f674a5888e14a217ae1944ca2d3d0e790a6a2018-08-26T13:24:23Z2018-08-26T13:24:23Zremove stdint.h header: not needed anymoreHiltjo Posthumahiltjo@codemadness.orgcommit aec6f674a5888e14a217ae1944ca2d3d0e790a6a
parent e484a6a4466f9d12d2d86d36e08f0e38d0dcc0e1
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 26 Aug 2018 15:24:23 +0200
remove stdint.h header: not needed anymore
e484a6a4466f9d12d2d86d36e08f0e38d0dcc0e12018-08-26T13:23:36Z2018-08-26T13:23:36Zxml: sync many XML parser improvementsHiltjo Posthumahiltjo@codemadness.orgcommit e484a6a4466f9d12d2d86d36e08f0e38d0dcc0e1
parent fc60e68a53c20270ac11a9f70be701eefb71ae19
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 26 Aug 2018 15:23:36 +0200
xml: sync many XML parser improvements
fc60e68a53c20270ac11a9f70be701eefb71ae192018-05-11T18:24:04Z2018-05-11T18:24:04Ztscrape_plain: include sys/types.hHiltjo Posthumahiltjo@codemadness.orgcommit fc60e68a53c20270ac11a9f70be701eefb71ae19
parent 8687ecf047ce644a17933cc4edf716bfb702ee83
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 11 May 2018 20:24:04 +0200
tscrape_plain: include sys/types.h
8687ecf047ce644a17933cc4edf716bfb702ee832018-05-11T18:20:50Z2018-05-11T18:20:50Zupdate tscrape.1: document formatHiltjo Posthumahiltjo@codemadness.orgcommit 8687ecf047ce644a17933cc4edf716bfb702ee83
parent b05127c4659da5b6d49e553877620008ea5732aa
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 11 May 2018 20:20:50 +0200
update tscrape.1: document format
b05127c4659da5b6d49e553877620008ea5732aa2018-05-11T18:20:36Z2018-05-11T18:20:36Zimprove Makefile, install tscrape.1Hiltjo Posthumahiltjo@codemadness.orgcommit b05127c4659da5b6d49e553877620008ea5732aa
parent d8794a48600762f8dff25c97d1b26c650a83f6b2
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 11 May 2018 20:20:36 +0200
improve Makefile, install tscrape.1
d8794a48600762f8dff25c97d1b26c650a83f6b22018-03-30T12:34:07Z2018-03-30T12:34:07Zbump LICENSEHiltjo Posthumahiltjo@codemadness.orgcommit d8794a48600762f8dff25c97d1b26c650a83f6b2
parent 203721702e9cac7b47cbeeb91011f232bb26b7bc
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 30 Mar 2018 14:34:07 +0200
bump LICENSE
203721702e9cac7b47cbeeb91011f232bb26b7bc2018-03-11T23:45:22Z2018-03-11T23:45:22Zremove unused variableHiltjo Posthumahiltjo@codemadness.orgcommit 203721702e9cac7b47cbeeb91011f232bb26b7bc
parent 227743f84d79e15f67b761e2d92e20dbc7083d81
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Mon, 12 Mar 2018 00:45:22 +0100
remove unused variable
227743f84d79e15f67b761e2d92e20dbc7083d812018-03-11T17:45:33Z2018-03-11T17:45:33Zsync xml improvementsHiltjo Posthumahiltjo@codemadness.orgcommit 227743f84d79e15f67b761e2d92e20dbc7083d81
parent 7789dc04f4937dd68677a953320537b3da519f3b
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 11 Mar 2018 18:45:33 +0100
sync xml improvements
... better CDATA and comment parsing, etc
7789dc04f4937dd68677a953320537b3da519f3b2017-08-26T13:36:10Z2017-08-26T13:36:10Zimprove and simplify ignore tag handlingHiltjo Posthumahiltjo@codemadness.orgcommit 7789dc04f4937dd68677a953320537b3da519f3b
parent e3bd0af8ac5af175c7dee7c24eadf238f5f4334f
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 26 Aug 2017 15:36:10 +0200
improve and simplify ignore tag handling
e3bd0af8ac5af175c7dee7c24eadf238f5f4334f2017-08-26T11:43:22Z2017-08-26T11:43:22Zxml: simplify a bitHiltjo Posthumahiltjo@codemadness.orgcommit e3bd0af8ac5af175c7dee7c24eadf238f5f4334f
parent cb8ed18e7f5f31e68c9d5ab11a6daa8677af6636
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 26 Aug 2017 13:43:22 +0200
xml: simplify a bit
cb8ed18e7f5f31e68c9d5ab11a6daa8677af66362017-08-26T10:43:15Z2017-08-26T10:43:15Zsimplify ignore tags parsingHiltjo Posthumahiltjo@codemadness.orgcommit cb8ed18e7f5f31e68c9d5ab11a6daa8677af6636
parent 2dc167003132b6d9db8e779f26681c560c07a119
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 26 Aug 2017 12:43:15 +0200
simplify ignore tags parsing
2dc167003132b6d9db8e779f26681c560c07a1192017-08-25T15:51:12Z2017-08-25T15:51:12Zwhen ignoring then ignore all attribute parsing aswellHiltjo Posthumahiltjo@codemadness.orgcommit 2dc167003132b6d9db8e779f26681c560c07a119
parent 1ff56f1ce94cd62b0c16ee343917435c9048b8b8
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 25 Aug 2017 17:51:12 +0200
when ignoring then ignore all attribute parsing aswell
1ff56f1ce94cd62b0c16ee343917435c9048b8b82017-08-25T15:44:37Z2017-08-25T15:44:37Zinitial support to ignore literals in <script> and <style>Hiltjo Posthumahiltjo@codemadness.orgcommit 1ff56f1ce94cd62b0c16ee343917435c9048b8b8
parent 006a11c3aced38fa2cc3915793c1b9e886d0ad41
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 25 Aug 2017 17:44:37 +0200
initial support to ignore literals in <script> and <style>
006a11c3aced38fa2cc3915793c1b9e886d0ad412017-08-25T15:39:02Z2017-08-25T15:39:02Zv -> c for classname shorthandHiltjo Posthumahiltjo@codemadness.orgcommit 006a11c3aced38fa2cc3915793c1b9e886d0ad41
parent c94c9c1f8d1ac27670bbd0faf5481e544af50e25
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 25 Aug 2017 17:39:02 +0200
v -> c for classname shorthand
c94c9c1f8d1ac27670bbd0faf5481e544af50e252017-08-25T15:36:14Z2017-08-25T15:36:14Zinitial video extract supportHiltjo Posthumahiltjo@codemadness.orgcommit c94c9c1f8d1ac27670bbd0faf5481e544af50e25
parent 30a8b7252f74e21ba76ca08d00ffc294abad302f
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 25 Aug 2017 17:36:14 +0200
initial video extract support
30a8b7252f74e21ba76ca08d00ffc294abad302f2017-08-25T14:37:55Z2017-08-25T14:37:55Zfix data-image-urlHiltjo Posthumahiltjo@codemadness.orgcommit 30a8b7252f74e21ba76ca08d00ffc294abad302f
parent 74421a0dd39d2f3cd496ab9e9efcf38e2fef594e
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 25 Aug 2017 16:37:55 +0200
fix data-image-url
74421a0dd39d2f3cd496ab9e9efcf38e2fef594e2017-08-13T09:17:50Z2017-08-13T09:17:50Zcleanup: remove baseurl and encoding and some leftovers from sfeedHiltjo Posthumahiltjo@codemadness.orgcommit 74421a0dd39d2f3cd496ab9e9efcf38e2fef594e
parent 3256e00c6408e55e17d9de130d677392acf78177
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 13 Aug 2017 11:17:50 +0200
cleanup: remove baseurl and encoding and some leftovers from sfeed
3256e00c6408e55e17d9de130d677392acf781772017-08-13T09:04:01Z2017-08-13T09:04:01Ztscrape_html: add style.cssHiltjo Posthumahiltjo@codemadness.orgcommit 3256e00c6408e55e17d9de130d677392acf78177
parent b8b076dffc9fa2002ffb380b7b1e7d184189f539
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 13 Aug 2017 11:04:01 +0200
tscrape_html: add style.css
b8b076dffc9fa2002ffb380b7b1e7d184189f5392017-08-13T08:26:47Z2017-08-13T08:26:47Zadd tscraperc example fileHiltjo Posthumahiltjo@codemadness.orgcommit b8b076dffc9fa2002ffb380b7b1e7d184189f539
parent 155b8a4fb6cbfe358721d3604bcd4526993f7897
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sun, 13 Aug 2017 10:26:47 +0200
add tscraperc example file
155b8a4fb6cbfe358721d3604bcd4526993f78972017-08-12T15:47:20Z2017-08-12T15:47:20Zadd tscrape_update, tscrape_html format program, update MakefileHiltjo Posthumahiltjo@codemadness.orgcommit 155b8a4fb6cbfe358721d3604bcd4526993f7897
parent 7bdeb05e31e28c4cfaf385dffa48ea80aa476315
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 12 Aug 2017 17:47:20 +0200
add tscrape_update, tscrape_html format program, update Makefile
similar to sfeed
7bdeb05e31e28c4cfaf385dffa48ea80aa4763152017-08-12T15:23:35Z2017-08-12T15:23:35Zfirst version of tscrape_htmlHiltjo Posthumahiltjo@codemadness.orgcommit 7bdeb05e31e28c4cfaf385dffa48ea80aa476315
parent 4640420521e94158d80f94202ed40f7dc4a66169
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 12 Aug 2017 17:23:35 +0200
first version of tscrape_html
4640420521e94158d80f94202ed40f7dc4a661692017-08-12T15:15:41Z2017-08-12T15:15:41Zparse own username and fullname from data, add item username and fullnameHiltjo Posthumahiltjo@codemadness.orgcommit 4640420521e94158d80f94202ed40f7dc4a66169
parent f712b91a8db0fb66f7facf349ea859da07717dc7
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 12 Aug 2017 17:15:41 +0200
parse own username and fullname from data, add item username and fullname
f712b91a8db0fb66f7facf349ea859da07717dc72017-08-12T10:52:23Z2017-08-12T10:52:23Zseparate parsing and formatting like sfeedHiltjo Posthumahiltjo@codemadness.orgcommit f712b91a8db0fb66f7facf349ea859da07717dc7
parent f0b8be83a871c59f1bd9a99f16bf20ce9df57c22
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 12 Aug 2017 12:52:23 +0200
separate parsing and formatting like sfeed
- remove formatted timestamp field.
- add tscrape_plain
f0b8be83a871c59f1bd9a99f16bf20ce9df57c222017-08-12T10:51:30Z2017-08-12T10:51:30Zstore retweet id instead of 0 or 1Hiltjo Posthumahiltjo@codemadness.orgcommit f0b8be83a871c59f1bd9a99f16bf20ce9df57c22
parent 7a70a71cc130ad9b1e0d2f95dbf5a4eb591f55c1
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 12 Aug 2017 12:51:30 +0200
store retweet id instead of 0 or 1
7a70a71cc130ad9b1e0d2f95dbf5a4eb591f55c12017-08-12T10:28:01Z2017-08-12T10:28:01Zonly print tweet if it has a text and usernameHiltjo Posthumahiltjo@codemadness.orgcommit 7a70a71cc130ad9b1e0d2f95dbf5a4eb591f55c1
parent 21f2194359d8f863c8c3a3a77c48b2507c03f924
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 12 Aug 2017 12:28:01 +0200
only print tweet if it has a text and username
21f2194359d8f863c8c3a3a77c48b2507c03f9242017-08-12T10:27:44Z2017-08-12T10:27:44Zreset classname in xmltagend is not neededHiltjo Posthumahiltjo@codemadness.orgcommit 21f2194359d8f863c8c3a3a77c48b2507c03f924
parent df78a8500f5b4c3d7aaf400ae33d88ec2468ab62
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Sat, 12 Aug 2017 12:27:44 +0200
reset classname in xmltagend is not needed
df78a8500f5b4c3d7aaf400ae33d88ec2468ab622017-08-11T17:29:30Z2017-08-11T17:29:30Zadd support of the message/status id and to see if pinned or retweetHiltjo Posthumahiltjo@codemadness.orgcommit df78a8500f5b4c3d7aaf400ae33d88ec2468ab62
parent d47213205f85aa1d34c2a1e2e414b6ff2452fc66
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 11 Aug 2017 19:29:30 +0200
add support of the message/status id and to see if pinned or retweet
d47213205f85aa1d34c2a1e2e414b6ff2452fc662017-08-11T17:29:01Z2017-08-11T17:29:01Zwith_replies link is not supported anymore I thinkHiltjo Posthumahiltjo@codemadness.orgcommit d47213205f85aa1d34c2a1e2e414b6ff2452fc66
parent e22ef54ff11eaa0c478591c1577c9e68ad335c75
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 11 Aug 2017 19:29:01 +0200
with_replies link is not supported anymore I think
e22ef54ff11eaa0c478591c1577c9e68ad335c752017-08-11T14:15:48Z2017-08-11T14:15:48Zparse classname better, hide u-hidden image links, but show direct image linksHiltjo Posthumahiltjo@codemadness.orgcommit e22ef54ff11eaa0c478591c1577c9e68ad335c75
parent b4bc9e6b47df5b9eb612f069c20463a924d6a55e
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 11 Aug 2017 16:15:48 +0200
parse classname better, hide u-hidden image links, but show direct image links
b4bc9e6b47df5b9eb612f069c20463a924d6a55e2017-08-11T13:46:57Z2017-08-11T13:46:57Zincrease buffer size, separate @username with spacesHiltjo Posthumahiltjo@codemadness.orgcommit b4bc9e6b47df5b9eb612f069c20463a924d6a55e
parent f92f2d068c213425f073d830a1dd8b86126168a5
Author: Hiltjo Posthuma <hiltjo@codemadness.org>
Date: Fri, 11 Aug 2017 15:46:57 +0200
increase buffer size, separate @username with spaces