Jan Lukas Gernert
|
6a58e45c7a
|
add cnet test
|
2023-03-10 07:05:10 +01:00 |
|
Jan Lukas Gernert
|
a915d8fe67
|
update some older tests
|
2023-03-10 06:36:21 +01:00 |
|
Jan Lukas Gernert
|
7b6d22ebc8
|
add cnet-svg-classes test
|
2023-03-10 06:33:24 +01:00 |
|
Jan Lukas Gernert
|
3ece2522bb
|
add clean links test
|
2023-03-09 21:24:29 +01:00 |
|
Jan Lukas Gernert
|
c5c6b788c8
|
add citilab test & fix noscript unwrapping
|
2023-03-09 20:10:03 +01:00 |
|
Jan Lukas Gernert
|
612f022879
|
add buzzfeed test
|
2023-03-06 01:36:37 +01:00 |
|
Jan Lukas Gernert
|
45b4141049
|
add new test
|
2023-03-06 00:04:23 +01:00 |
|
Jan Lukas Gernert
|
9c5ffda5de
|
add breitbart test
|
2023-03-04 23:40:23 +01:00 |
|
Jan Lukas Gernert
|
e2b804d00a
|
add blogger test
|
2023-03-04 17:41:22 +01:00 |
|
Jan Lukas Gernert
|
6964724102
|
add bbc test
|
2023-03-02 01:09:44 +01:00 |
|
Jan Lukas Gernert
|
4031750956
|
tag cleaning test
|
2023-03-01 01:37:44 +01:00 |
|
Jan Lukas Gernert
|
cea23f1638
|
always use fakehost url for tests
|
2023-03-01 00:46:35 +01:00 |
|
Jan Lukas Gernert
|
80de6d177c
|
url completion test
|
2023-03-01 00:42:44 +01:00 |
|
Jan Lukas Gernert
|
451dd61547
|
add two new tests
|
2023-02-28 18:28:55 +01:00 |
|
Jan Lukas Gernert
|
aea57d0cf3
|
fix has_single_tag_inside_element & update tests
|
2023-02-28 03:59:48 +01:00 |
|
Jan Lukas Gernert
|
31a8033844
|
fixes, more sanitation & 1 more failing test
|
2023-02-28 01:50:13 +01:00 |
|
Jan Lukas Gernert
|
df999cd9fc
|
more cleanups & more tests
|
2023-02-27 01:00:56 +01:00 |
|
Jan Lukas Gernert
|
0834c4d72a
|
fixes
|
2023-02-26 02:22:53 +01:00 |
|
Jan Lukas Gernert
|
e3246af28b
|
refactor & more testing
|
2023-02-25 00:42:26 +01:00 |
|
Jan Lukas Gernert
|
cce912c354
|
first content extraction kinda working
|
2023-02-20 00:29:44 +01:00 |
|
Jan Lukas Gernert
|
c1ae011fcd
|
use global rules
|
2022-10-07 07:17:31 +02:00 |
|
Jan Lukas Gernert
|
8f48b69161
|
remove unneeded files
|
2020-04-28 03:07:21 +02:00 |
|
Jan Lukas Gernert
|
a99b8dec47
|
wip: test libxml XML_SAVE_NO_EMPTY option
|
2019-09-24 18:45:06 +02:00 |
|
Jan Lukas Gernert
|
4b2e6a24eb
|
initial commit
|
2018-07-31 16:10:09 +02:00 |
|