Zenon Mousmoulas
9fe6f557eb
Merge 3872619ed5
into d55d1f423d
2024-12-16 14:56:48 +02:00
pukkandan
1d3751c3fe
Escape URLs in sanitized_Request
, not sanitize_url
d2558234cf5dd12d6896eed5427b7dcdb3ab7b5a added escaping of URLs while sanitizing. However, sanitize_url
may not always receive an actual URL. Eg: When using youtube-dl "search query" --default-search ytsearch
, search query
gets escaped to search%20query
before being prefixed with ytsearch:
which is not the intended behavior. So the escaping is moved to sanitized_Request
instead.
2023-02-20 20:27:25 +00:00
dirkf
195f22f679
[generic] Improve KVS (etc) extraction
2022-11-13 15:09:29 +00:00
dirkf
fc2beab0e7
[generic] Improve KVS (etc) extraction
...
* detect kt_player('kt_player', 'https://.../kt_player.swf?v=5 ...
* detect age limit if 18 USC 2257 is mentioned
* test with shooshtime.com
Partially resolves #31332 .
2022-11-13 14:59:30 +00:00
FraFraFra-LongD
1a4fbe8462
Added ThisVid.com support ( #29187 )
...
* add ThisVidIE, ThisVidMemberIE, ThisVidPlaylistIE
* redirect embed to main page for more metadata
* use KVS extraction newly added to GenericIE and remove duplicate tests
* also add MrDeepFake etc compat to GenericIE
(closes #22390 )
Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-11-13 13:22:04 +00:00
dirkf
c2f9be3e63
[generic] Add KVS player extraction
2022-11-12 11:55:05 +00:00
Zenon Mousmoulas
9af0f299bf
Add Ant1NewsGr IEs
...
* Add new IEs
* Ant1NewsGrBaseIE: Base IE class
* Ant1NewsGrWatchIE: Extract videos from TV VOD pages
* Ant1NewsGrArticleIE: Extract videos from news articles
* Ant1NewsGrEmbedIE: Extract iframe-embeddable ant1news.gr videos
* NB: There is a different platform at vod.antenna.gr, which is not
covered here
* The Generic extractor can also be used to extract videos from news
article pages (through Ant1NewsGrEmbed._extract_urls), however a
specific IE is used to extract JSON-LD @type='NewsArticle' metadata
* However that does not work currently, as _json_ld does not support
@graph nesting
* Ant1NewsGrArticleIE defers to Ant1NewsGrEmbedIE, either as a playlist
or a single video
* Ant1NewsGrWatchIE and Ant1NewsGrEmbedIE query an API endpoint to
extract metadata, get the respective stream/source URLs and detect
video formats
* The endpoint HTTP path varies per IE
* Ant1NewsGrEmbedIE first resolves any redirects for its' own URL, to
derive the properl base URL for the API query
2021-11-11 15:47:41 +02:00
Aleri Kaisattera
379f52a495
[liveleak] Remove extractor ( closes #17625 , closes #24222 ) ( #29331 )
2021-06-21 04:23:50 +07:00
Sergey M․
1e8aaa1d15
[generic] Add support for og:audio ( closes #28311 , closes #29015 )
2021-05-16 21:42:38 +07:00
Sergey M․
286e01ce30
[generic] Add support for sibnet embeds
2021-05-16 20:50:32 +07:00
Sergey M․
552b139911
[generic] Add Referer header for direct videojs download URLs ( closes #2879 , closes #20217 , closes #29053 )
2021-05-16 20:29:35 +07:00
The Hatsune Daishi
eafcadea26
[extractor] escape forgotten dot for hostnames in regular expression ( #28530 )
2021-03-24 14:33:19 +00:00
dmsummers
e20ec43094
[simplecast] Add new extractor( closes #24107 )
2021-02-17 14:53:23 +01:00
Remita Amine
c669554ef5
[medialaan] add support DPG Media MyChannels based websites
...
closes #14871
closes #15597
closes #16106
closes #16489
2021-01-27 14:58:47 +01:00
Remita Amine
ab89a8678b
[arcpublishing] Add new extractor
...
closes #2298
closes #9340
closes #17200
2020-12-31 23:31:07 +01:00
Remita Amine
0fa67c1d68
[generic] Add support for VHX Embeds( #27546 )
2020-12-27 09:00:07 +01:00
Sergey M․
af1312bfc3
[youtube:tab] Extend _VALID_URL ( closes #27501 )
2020-12-26 19:59:57 +07:00
Sergey M․
3729c52f9d
[generic] Improve RSS age limit extraction
2020-12-19 23:24:52 +07:00
renalid
71679eaee8
[generic] Fix RSS itunes thumbnail extraction ( #27405 )
2020-12-19 23:18:51 +07:00
Andrey Smirnoff
b69bb1ed11
[smotri] Remove extractor ( #27358 )
2020-12-10 02:01:23 +07:00
Remita Amine
3ded751985
[generic] comment a test covered now by AmericasTestKitchenIE
2020-12-07 15:27:13 +01:00
Sergey M․
791b743765
[extractor/generic] Remove unused import
2020-12-07 02:01:47 +07:00
Sergey M․
dccf4932e1
[extractor/common] Extract timestamp from Last-Modified header
2020-12-07 01:15:30 +07:00
Sergey M․
91dd25fe1e
[extractor/common] Add support for dl8-* media tags ( closes #27283 )
2020-12-07 01:08:22 +07:00
Sergey M․
f2c704e112
[generic] Extract RSS video itunes metadata
2020-12-06 23:08:03 +07:00
Sergey M․
5e822c2526
[generic] Extract RSS video timestamp
2020-12-06 22:30:30 +07:00
renalid
cc017e07ca
[generic] Extract RSS video description ( #27177 )
2020-12-06 22:12:25 +07:00
Josh Soref
71ddc222ad
Fix typos ( #27084 )
...
* spelling: authorization
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: brightcove
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: creation
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: exceeded
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: exception
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: extension
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: extracting
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: extraction
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: frontline
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: improve
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: length
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: listsubtitles
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: multimedia
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: obfuscated
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: partitioning
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: playlist
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: playlists
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: restriction
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: services
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: split
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: srmediathek
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: support
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: thumbnail
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: verification
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: whitespaces
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
2020-11-21 22:00:05 +07:00
Sergey M․
91e954587f
[arte] Rework extractors
...
* Reimplement embed and playlist extractors to delegate to the single entrypoint artetv extractor
Beware reluctant download archive extractor keys breakage.
* Improve embeds detection (closes #27057 )
- Remove obsolete code
2020-11-19 05:02:04 +07:00
Sergey M․
562de77f41
[kaltura] Add support for multiple embeds on a webpage ( closes #25523 )
2020-06-06 02:14:35 +07:00
Sergey M․
4e7b5bba5f
[mofosex] Add support for generic embeds ( closes #24633 )
2020-04-06 01:29:58 +07:00
Sergey M․
52c4c51556
[youporn] Add support form generic embeds
2020-04-05 20:56:14 +07:00
Sergey M․
8fae1a04eb
[spankwire] Add support for generic embeds (refs #24633 )
2020-04-05 20:42:56 +07:00
Sergey M․
be7dacf9cf
[generic] Look for teachable embeds before wistia
2020-03-24 02:57:38 +07:00
Sergey M․
7947a1f7db
Remove no longer needed compat_str around geturl
2020-02-29 19:19:24 +07:00
Sergey M․
fda6d237a5
[wistia] Add support for multiple generic embeds ( closes #8347 , closes #11385 )
2020-02-23 06:47:11 +07:00
Remita Amine
a9866c0366
[zype] improve extraction
...
- extract subtitles(closes #21258 )
- support URLs with alternative keys/tokens(#21258 )
- extract more metadata
2020-01-15 14:18:04 +01:00
Remita Amine
bd2c211fcc
[vodplatform] add support for embed.kwikmotion.com domain
2020-01-12 17:34:57 +01:00
Remita Amine
6471d0d3b8
[openload] remove OpenLoad related extractors( closes #11999 )( closes #15406 )
2019-11-26 23:57:37 +01:00
Remita Amine
55adb63e54
[kinja] add support for Kinja embeds
...
closes #5756
closes #11282
closes #22237
closes #22384
2019-11-06 19:56:10 +01:00
Remita Amine
548c395716
[soundcloud] improve extraction
...
- improve format extraction(closes #22123 )
- extract uploader_id and uploader_url(closes #21916 )
- extract all known thumbnails(closes #19071 )(closes #20659 )
- fix extration for private playlists(closes #20976 )
- add support for playlist embeds(#20976 )
- skip preview formats(closes #22806 )
2019-10-27 17:52:46 +01:00
Remita Amine
416c3ca7f5
[odnoklassniki] add support for Schemeless embed extraction
2019-10-25 19:27:28 +01:00
Remita Amine
755541a4c8
[mangomolo] fix video format extraction and add support for player URLs
2019-10-17 13:21:44 +01:00
Remita Amine
c317b6163b
[vessel] remove extractor
2019-10-10 00:01:51 +01:00
Sergey M․
7cb51b5daf
[extractor/generic] Improve squarespace detection and fix test ( closes #21859 , refs #21294 , refs #21802 )
2019-09-01 01:25:48 +07:00
Barbara Miller
d78657fd18
[extractor/generic] Add support for squarespace embeds ( closes #21294 )
2019-09-01 01:25:48 +07:00
Sergey M․
9c1da4a9f9
[extractor/generic] Restrict --default-search schemeless URLs detection pattern ( closes #21842 )
2019-07-20 23:08:26 +07:00
Remita Amine
5e3da0d42b
[dailymotion] add support embed with DM.player js call
2019-07-01 08:37:21 +01:00
ealgase
612300a686
[novamov] Remove extractors ( #21077 )
...
Sites no longer exist
2019-05-23 03:38:48 +07:00
Sergey M․
3089bc748c
Fix W504 and disable W503 ( closes #20863 )
2019-05-11 03:57:40 +07:00