NewPipeExtractor

Commit Graph

Author	SHA1	Message	Date
AudricV	f92426560c	Add descriptive audio properties Also improve AudioStream's audio language documentation	2023-02-26 19:06:16 +01:00
AudricV	a63f289667	[YouTube] Update mocks of YoutubeCommentsExtractorTest.FormattingTest	2023-02-26 18:50:07 +01:00
AudricV	9483dcd9fa	[YouTube] Update mocks of YoutubeCommentsExtractorTest.RepliesTest	2023-02-26 18:43:36 +01:00
AudricV	1556adbb2d	[YouTube] Fix hashtags links extraction and escape text in attribute descriptions + HTML links webCommandMetadata object is contained inside a commandMetadata one, so it is not accessible from the root of the navigationEndpoint object. The corresponding statement has been moved at the bottom of the specific endpoints parsing, as the webCommandMetadata object is present almost everywhere, otherwise URLs of some endpoints would have be changed, such as uploader URLs (from channel IDs to handles). As no ParsingException is now thrown by getUrlFromNavigationEndpoint, and so by getTextFromObject, getUrlFromObject and getTextAtKey, the methods which were catching ParsingExceptions thrown by these methods had to be updated. URLs got in the HTML version of getTextFromObject are now escaped properly to provide valid HTML to clients. This has been also done for attribute descriptions, with the description text for this type of descriptions. As YouTube descriptions are in HTML format (except for the fallback on the JSON player response, which is plain text and only happens when there is no visual metadata or a breaking change), all URLs returned are escaped, so tests which are testing presence of URLs with escaped characters had to be updated (it was only the case for YoutubeStreamExtractorDefaultTest.DescriptionTestUnboxing).	2023-02-26 18:43:36 +01:00
petlyh	f7a7a236fb	[Bandcamp] Show comments as disabled on radio streams	2023-02-23 18:42:43 +01:00
dependabot[bot]	f5599ff08d	Bump org.jsoup:jsoup from 1.15.3 to 1.15.4 Bumps [org.jsoup:jsoup](https://github.com/jhy/jsoup) from 1.15.3 to 1.15.4. - [Release notes](https://github.com/jhy/jsoup/releases) - [Changelog](https://github.com/jhy/jsoup/blob/master/CHANGES) - [Commits](https://github.com/jhy/jsoup/compare/jsoup-1.15.3...jsoup-1.15.4) --- updated-dependencies: - dependency-name: org.jsoup:jsoup dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2023-02-20 10:05:26 +00:00
TobiGr	3f7df9536e	[YouTube] Fix getting the comment text if the comment contains a hashtag	2023-01-29 20:33:51 +01:00
Stypox	999fb7f812	Merge pull request #1024 from AudricV/snd_fix-tracks-like-count [SoundCloud] Fix extraction of tracks like count	2023-01-29 10:52:54 +01:00
Stypox	3519d4c367	Merge pull request #1015 from AudricV/yt_fix-channel-id-rss-feeds [YouTube] Fix channel ID extraction of YouTube channels RSS feeds	2023-01-29 10:41:38 +01:00
Stypox	9aca710e86	Merge pull request #1013 from Stypox/fix-music-mixes [YouTube] Now music mixes can be treated as normal mixes	2023-01-29 09:48:51 +01:00
Stypox	76eeabac45	Merge pull request #1020 from TeamNewPipe/fix/yt-subscriber-count [YouTube] Fix NPE in search when getting channel items without subscriber count	2023-01-29 09:44:22 +01:00
AudricV	676622f6df	[SoundCloud] Fix expectedLikeCountAtLeast tests of SoundcloudStreamExtractorTest test classes As like count is now returned by the extractor, we need to assert a positive minimum like count, which is close to the actual value, in order to avoid test failures due to lower like counts than the ones excepted.	2023-01-29 01:08:02 +01:00
AudricV	2a24d407d5	[SoundCloud] Fix extraction of tracks like count SoundCloud is using likes_count to return the like count of a track, like it was the case before they switched to favoritings_count.	2023-01-29 01:00:49 +01:00
AudricV	ba24976e41	[YouTube] Add live URLs test and do minor improvements to YoutubeStreamLinkHandlerFactoryTest - Remove unused imports; - Replace wildcard imports by single class imports; - Suppress "HTTP links are not secured" warnings from IDEA IDEs; - Replace removed video jZViOEv90dI by an existing video, 9Dpqou5cI08 (the corresponding test has been of course renamed).	2023-01-28 19:36:21 +01:00
AudricV	57f850bc2d	[YouTube] Support live URLs and do minor improvements to YoutubeStreamLinkHandlerFactory - Move license header at the top; - Use an unmodifiable set for the subpaths instead of a modifiable list; - Add missing Nonnull and Nullable annotations; - Improve exception messages.	2023-01-28 19:36:20 +01:00
AudricV	1f4ed9dce9	[YouTube] Fix channel ID extraction of YouTube channel RSS feeds The yt:channelId element doesn't provide the channel ID anymore and is empty, like the id element, so we need now to extract it from the channel URL provided in two elements: author -> uri and feed -> link. Also avoid a NullPointerException in getUrl and getName methods.	2023-01-28 11:53:33 +01:00
Tobi	c589a2c1a2	Merge pull request #1014 from TeamNewPipe/fix/yt-comments [YouTube] Fix getting next comments pages	2023-01-27 11:14:55 +01:00
TobiGr	72573932cf	[YouTube] Fix NPE in search when getting channel items without subscriber count	2023-01-24 23:03:45 +01:00
TobiGr	f50b7275af	[YouTube] Fix getting next comments pages	2023-01-24 22:39:08 +01:00
Kunal	9bdad40b06	Removed topStandaloneBadge	2023-01-20 02:41:21 +05:30
Stypox	5945057227	[YouTube] Add music mix test	2023-01-15 23:30:30 +01:00
Stypox	7293991832	[YouTube] Now music mixes can be treated as normal mixes Using a playlist extractor on them would result in "Unviewable playlist" errors	2023-01-15 23:28:59 +01:00
Stypox	ff94e9f30b	Merge pull request #1009 from TeamNewPipe/dependabot/gradle/com.google.code.gson-gson-2.10.1 Bump gson from 2.10 to 2.10.1	2023-01-11 15:35:36 +01:00
Stypox	c1040bccac	Merge pull request #794 from FireMasterK/comments-count [YouTube] Add support to extract total comment count	2023-01-11 15:32:19 +01:00
dependabot[bot]	f43049985e	Bump gson from 2.10 to 2.10.1 Bumps [gson](https://github.com/google/gson) from 2.10 to 2.10.1. - [Release notes](https://github.com/google/gson/releases) - [Changelog](https://github.com/google/gson/blob/master/CHANGELOG.md) - [Commits](https://github.com/google/gson/compare/gson-parent-2.10...gson-parent-2.10.1) --- updated-dependencies: - dependency-name: com.google.code.gson:gson dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2023-01-09 09:05:55 +00:00
TobiGr	56aab4d971	[YouTube] Fix escaping links in YouTubeParsingHelper.getTextFromObject	2023-01-05 00:28:12 +01:00
Kavin	22a47da8c7	Fix requested change and remove outdated comment.	2023-01-02 20:42:32 +00:00
Kavin	98a90fd9c8	Don't cache comments count and return early on page fetch if no token.	2023-01-02 20:40:48 +00:00
Kavin	2974dfaa48	Only store ajaxJson for initial page and eager fetch the initial continuation.	2023-01-02 20:40:48 +00:00
Kavin	64d24aa09e	Fix request changes.	2023-01-02 20:40:48 +00:00
Kavin	67ef4f4c30	Cleanup and remove optional.	2023-01-02 20:40:48 +00:00
FireMasterK	22f71b010c	Fix for requested changes.	2023-01-02 20:40:48 +00:00
FireMasterK	656b7c1cd9	Improve method documentation.	2023-01-02 20:40:48 +00:00
FireMasterK	981aee4092	Add support to extract total comment count.	2023-01-02 20:40:48 +00:00
Stypox	45636b0d00	Merge pull request #986 from Isira-Seneviratne/Static_maps Use immutable Map factory methods.	2023-01-02 18:11:14 +01:00
Stypox	219c5c5be5	Update extractor/src/main/java/org/schabi/newpipe/extractor/services/youtube/YoutubeParsingHelper.java	2023-01-02 18:11:03 +01:00
Stypox	259de3cba6	Merge pull request #995 from TeamNewPipe/feat/soundcloud-playlistinfoitemextractor [SoundCloud] Implement getUploaderUrl() and isUploaderVerified() for PlaylistInfoItemExtractor	2023-01-02 15:10:40 +01:00
Stypox	991394b53a	Merge pull request #1005 from FireMasterK/fix-escaping-xss Fix for potential XSS attacks and formatting issues	2023-01-02 15:06:17 +01:00
Isira Seneviratne	d8ce08d969	Use immutable Map factory methods.	2023-01-02 07:50:31 +05:30
Kavin	01acf79436	Fix for potential XSS attacks.	2022-12-31 20:05:32 +00:00
TobiGr	292e0d8ce7	[SoundCloud] Implement getUploaderUrl() and isUploaderVerified() for PlaylistInfoItemExtractor	2022-12-31 18:46:39 +01:00
TobiGr	2a8729aeb2	Apply suggestions Co-authored-by: Stypox <stypox@pm.me>	2022-12-31 18:24:33 +01:00
TobiGr	d75a997611	[PeerTube] Support searching for channels	2022-12-31 18:24:33 +01:00
TobiGr	dea6d8ce4c	[PeerTube] Support searching for playlists	2022-12-31 18:24:33 +01:00
Stypox	95cc6aefbb	Merge pull request #994 from TeamNewPipe/fix/peertube-subtitles-exception [PeerTube] Report Exceptions thrown while getting a stream's subtitles	2022-12-31 15:01:39 +01:00
Stypox	7b54457789	Merge pull request #941 from TeamNewPipe/feat/peertube-comment-replies [PeerTube] Support comment replies	2022-12-31 14:57:51 +01:00
AudricV	f45966d449	Merge pull request #910 from Isira-Seneviratne/Locale_forLanguageTag Add compat Locale.forLanguageTag() implementation.	2022-12-24 23:53:30 +01:00
AudricV	d5437e0bc5	Merge pull request #863 from AudricV/add-content-type-and-content-length-headers-to-post-requests Add Content-Type header to all POST requests without an empty body	2022-12-16 19:32:56 +01:00
AudricV	0766b1d211	[YouTube] Improve YoutubeStreamInfoItemExtractor - Return duration of video premieres; - Add another non-localized method to determine whether a stream is a running livestream; - Return view count and upload date of videos in playlists; - Store isPremiere result; - Remove shorts workaround code, as it was only useful on channels and shorts have been moved into a separated channel tab; - Improve some other code.	2022-12-08 13:59:12 +01:00
Tobi	896d7e09eb	Merge pull request #978 from Theta-Dev/fix/search-channel-handles [YouTube] Fix search subscriber count extraction with channel handles	2022-12-05 17:52:05 +01:00
TobiGr	cd3262745d	[PeerTube] Report Exceptions thrown while getting a stream's subtitles	2022-12-03 16:11:21 +01:00
TobiGr	4e66b2287e	[PeerTube] Add support for comment replies	2022-12-01 14:05:18 +01:00
Tobi	41c8dce452	Merge pull request #992 from Isira-Seneviratne/String_isBlank Use String.isBlank().	2022-11-30 17:48:54 +01:00
Isira Seneviratne	2bca56f0df	Use String.isBlank().	2022-11-30 08:26:21 +05:30
Isira Seneviratne	3b80547976	Add code review suggestions.	2022-11-30 07:57:45 +05:30
ThetaDev	016623131e	docs: update comment in YoutubeChannelInfoItemExtractor	2022-11-29 19:06:03 +01:00
Kavin	2e08eaad96	Fix complication error in comment test.	2022-11-29 16:07:48 +00:00
Kavin	abf08e1496	Merge pull request #990 from FireMasterK/bold-italic-strikethrough [YouTube] Implement bold/italic/strike-through support	2022-11-29 15:59:38 +00:00
Kavin	57e7a6fb7c	Add mocks test.	2022-11-28 20:27:55 +00:00
Kavin	1d3d7fa5c3	Add test for formatting.	2022-11-28 20:26:37 +00:00
Kavin	52fda37915	Implement bold/italic/strike-through support.	2022-11-28 19:06:18 +00:00
Kavin	b566084cac	Use Description object for comments text.	2022-11-28 17:02:19 +00:00
Tobi	f8162b049d	Merge pull request #984 from FireMasterK/unused-dep Remove unused autolink dependency	2022-11-28 11:28:42 +01:00
Tobi	1da0190056	Merge pull request #980 from TeamNewPipe/fix/yt/unavailable [YouTube] Fix extracting the detailed error message for unavailable streams	2022-11-28 10:07:34 +01:00
Stypox	60fb30f835	Merge pull request #928 from FireMasterK/comment-urls Parse YouTube comments as HTML	2022-11-27 19:16:34 +01:00
Kavin	5abea22225	Fix throwing correct reason.	2022-11-26 21:09:08 +00:00
Kavin	faf28f5c11	Remove unused dependency.	2022-11-26 20:17:25 +00:00
Kavin	c043597255	Update supported countries list.	2022-11-26 19:01:33 +00:00
TobiGr	4680df0bdf	Fix throwing correct reason	2022-11-23 17:03:22 +01:00
TobiGr	9de8405c9f	[YouTube] Fix extracting the detailed error message of streams which are unavailable	2022-11-23 08:33:06 +01:00
Stypox	34d79bd267	[YouTube] Update mocks	2022-11-22 17:10:04 +01:00
AudricV	2ec296e674	Fix YoutubeSearchExtractorTest.MetaInfoTest Not all the "learn more" button is uppercase anymore, that's only the case for the first letter.	2022-11-22 16:34:54 +01:00
AudricV	3891542ca1	Use Downloader's postWithContentType and postWithContentTypeJson methods in services and extractors	2022-11-22 11:37:18 +01:00
AudricV	b2862f3cd1	Add postWithContentType and postWithContentTypeJson utility methods in Downloader Co-authored-by: Stypox <stypox@pm.me>	2022-11-22 11:37:17 +01:00
AudricV	e9a0d3bd95	[YouTube] Send Content-Type header in all POST requests This header was not sent partially before and was added and guessed by OkHttp. This can create issues when using other HTTP clients than OkHttp, such as Cronet. Some code in the modified classes has been improved and / or deduplicated, and usages of the UTF_8 constant of the Utils class has been replaced by StandardCharsets.UTF_8 where possible. Note that this header has been not added in except in YoutubeDashManifestCreatorsUtils, as an empty body is sent in the POST requests made by this class.	2022-11-22 11:37:16 +01:00
AudricV	b9e463de49	[Bandcamp] Send Content-Type header in POST requests This header was not sent before and was added and guessed by OkHttp. This can create issues when using other HTTP clients than OkHttp, such as Cronet. Also make use of StandardCharsets.UTF_8 when getting bytes of bodies instead of the platform default's charset, to make sure to prevent some encoding issues on some JVMs.	2022-11-22 11:35:46 +01:00
AudricV	65d6321e3d	Fix typos in Downloader.post JavaDocs Post methods in Downloader return the result of a POST request and not the one of a GET request.	2022-11-22 11:35:46 +01:00
ThetaDev	5daabd1793	fix: #976 search subscriber count extraction with channel handles	2022-11-22 02:17:10 +01:00
Kavin	c953e23414	Merge pull request #968 from AudricV/yt-support-no-video-info-renderers-for-streams [YouTube] Support lack of video info renderers for streams	2022-11-16 20:20:01 +00:00
Tobi	2211a24b69	Merge pull request #971 from lrusso96/patch-1 [YouTube] Improve duration parsing	2022-11-16 16:14:54 +01:00
Kavin	86f06b333a	Address review.	2022-11-14 00:05:31 +00:00
Kavin	b16e6082e1	Add test for audio stream languages.	2022-11-13 23:10:44 +00:00
Kavin	30909da1df	Fix audio track similar comparison for IDs.	2022-11-13 23:08:54 +00:00
Kavin	6d59cdbe3a	Add support for extracting audio tracks.	2022-11-13 21:39:29 +00:00
Isira Seneviratne	e4d982c7ea	Fix license.	2022-11-12 07:29:15 +05:30
Isira Seneviratne	416089146e	Fix failing tests.	2022-11-12 07:29:15 +05:30
Isira Seneviratne	ddbce3b83d	Add Utils methods for URL encoding/decoding using UTF-8.	2022-11-12 07:29:15 +05:30
Isira Seneviratne	366f5c1632	Use StandardCharsets.UTF_8.	2022-11-12 07:29:15 +05:30
dependabot[bot]	0b82fade51	Bump gson from 2.9.1 to 2.10 Bumps [gson](https://github.com/google/gson) from 2.9.1 to 2.10. - [Release notes](https://github.com/google/gson/releases) - [Changelog](https://github.com/google/gson/blob/master/CHANGELOG.md) - [Commits](https://github.com/google/gson/compare/gson-parent-2.9.1...gson-parent-2.10) --- updated-dependencies: - dependency-name: com.google.code.gson:gson dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2022-11-09 16:21:16 +00:00
Luigi Russo	c9635218e2	[YouTube] Improve duration parsing	2022-11-09 09:41:29 +01:00
Isira Seneviratne	316d8573fa	Use immutable sets in YoutubeParsingHelper.	2022-11-07 07:50:26 +05:30
AudricV	6a2c680d8f	[YouTube] Add mocks for YoutubeStreamExtractorDefaultTest.NoVisualMetadataVideoTest	2022-11-04 19:43:06 +01:00
AudricV	e66fed41d6	[YouTube] Add a StreamExtractor test for a video without visual metadata The video "Makani’s first commercial-scale energy kite" (video ID: An8vtD1FDqs), which has this behavior, is used for the new test, NoVisualMetadataVideoTest, added in YoutubeStreamExtractorDefaultTest. Tests of elements who throw an exception in this case (subscriber count, like count, uploader avatar URL) test if the ParsingException exception is thrown by YoutubeStreamExtractor.	2022-11-04 19:42:12 +01:00
AudricV	aa9a8ca23c	[YouTube] Make non-extraction of videoPrimaryInfoRenderer and/or videoSecondaryInfoRenderer not fatal Also de-duplicated common code related to the obtain of these video info renderers. This change allows extraction of videos without visual metadata.	2022-11-04 18:35:53 +01:00
AudricV	20cd8e8a4a	[YouTube] Update mocks of YoutubeChannelExtractorTest.Gronkh	2022-11-03 19:46:42 +01:00
AudricV	a34f060ba0	[YouTube] Use a handle for YoutubeChannelExtractorTest.Gronkh	2022-11-03 19:46:42 +01:00
AudricV	724f669ff7	[YouTube] Add tests for handles and user IDs with non ASCII characters support Unneeded public modifiers in test methods of YoutubeChannelLinkHandlerFactoryTest have been also removed.	2022-11-03 19:46:42 +01:00
AudricV	61ce041bda	[YouTube] Support handles and all custom channel names More non-channel paths have been also added to the excluded custom name paths, documentation and exception messages have been improved and fixed in some places, and the licence header of YoutubeChannelLinkHandlerFactory has been moved to its beginning and updated.	2022-11-03 19:46:42 +01:00
AudricV	ffffb04439	Merge pull request #953 from Theta-Dev/attributed-text-desc [YouTube] Add support for attributed text description	2022-11-03 18:34:30 +01:00
ThetaDev	592e1d6386	fix: parsing attributed description with no command runs	2022-11-03 12:10:52 +01:00
ThetaDev	099b53cc4f	[YouTube] Add parser for attributedDescription Also update the mock of the next InnerTube endpoint response of the YoutubeStreamExtractorDefaultTest.DescriptionTestUnboxing test class with an attributedDescription instead of a regular description	2022-11-02 23:11:33 +01:00
AudricV	e4c24d4c36	[YouTube] Regenerate supported channels mocks	2022-11-02 19:13:59 +01:00
Theta-Dev	20e4a35814	[YouTube] Support richGridRenderer on channel pages YouTube is deploying a new layout on their channel pages, which uses richGridRenderer JSON objects.	2022-11-02 19:01:29 +01:00
AudricV	4cae66f1f9	Merge pull request #946 from chowder/dev Add ability to identify short-form `StreamInfoItem`s	2022-11-01 12:19:58 +01:00
Tobi	eb40bb8458	Merge pull request #959 from FireMasterK/playlist-info-item-uploader Add uploaderUrl and uploaderVerified to PlaylistInfoItem.	2022-10-31 13:10:33 +01:00
chowder	b1a899fd47	Fix null pointer exception	2022-10-31 11:12:23 +00:00
Stypox	a4db106a66	Merge pull request #960 from AudricV/yt-workaround-403-errors-android-client [YouTube] Workaround 403 HTTP errors of ANDROID client streams	2022-10-30 21:53:26 +01:00
Kavin	b441910257	Mark uploaderUrl as nullable.	2022-10-30 13:36:04 +00:00
chowder	3fdc0e72cc	Line breaks for long docstrings	2022-10-30 13:28:39 +00:00
Kavin	6a256d0631	Add uploader url and verified to PlaylistInfoItem.	2022-10-30 13:00:19 +00:00
Tobi	430504b4b5	Merge pull request #958 from AudricV/yt-playlists-support-new-metadata-format [YouTube] Support new metadata format of playlists	2022-10-30 12:31:43 +01:00
Kavin	f9bd08c649	Address reviews.	2022-10-30 01:25:30 +00:00
Caleb	9282c3c13b	Fix exception message for YoutubeStreamInfoItemExtractor#isShortFormContent Co-authored-by: AudricV <74829229+AudricV@users.noreply.github.com>	2022-10-30 01:23:15 +00:00
Caleb	c5216f7c12	Update docstring for StreamExtractor#isShortFormContent Co-authored-by: AudricV <74829229+AudricV@users.noreply.github.com>	2022-10-30 01:23:15 +00:00
Caleb	04795fe5d2	Use Stream API for ShortFormContent#testShortFormContent Co-authored-by: AudricV <74829229+AudricV@users.noreply.github.com>	2022-10-30 01:23:15 +00:00
chowder	975b38a0b9	Regenerate test mocks	2022-10-30 01:23:15 +00:00
chowder	4cccd33f3d	Implement isShortFormContent for StreamExtractor and StreamInfo	2022-10-30 01:23:15 +00:00
chowder	09544ceb23	Fix tests	2022-10-30 01:23:15 +00:00
chowder	644cc6cd76	Rename test function name	2022-10-30 01:23:15 +00:00
chowder	daf5674951	Add ability to identify short-form StreamInfoItems	2022-10-30 01:23:12 +00:00
Tobi	3d314169b9	Merge pull request #943 from TeamNewPipe/fix/sc/comments [SoundCloud] Fix getting more comments	2022-10-29 22:19:50 +02:00
AudricV	65125a3eb4	[YouTube] Fix YoutubePlaylistExtractorTest.LearningPlaylist The video count is now returned for this playlist, so it isn't unknown. The testStreamCount method of this test class asserts now that the stream count is greater than 40.	2022-10-29 18:12:10 +02:00
AudricV	7258a53225	[YouTube] Support new playlist layout This new layout doesn't provide author thumbnails and is completely different for metadata, so the code to get them has been refactored. The code of learning playlists video count check has been also removed, as it seems to be not relevant anymore (the video count seems to be returned for these playlists with both layouts). Finally, unneeded overrides of subchannel methods, which don't apply to the YouTube service, have been removed.	2022-10-29 18:12:10 +02:00
AudricV	e37e8b358e	[YouTube] Update mocks that need to be updated	2022-10-29 18:10:18 +02:00
AudricV	60e97cd274	[YouTube] Workaround getting streaming URLs returning 403 HTTP response codes Using the player parameters used to get stories seems to fix the issue, which affects currently only certain countries such as UK. This is a workaround and should be fixed in a better way (by changing the InnerTube additional client used for videos or finding what is now required in Android player requests).	2022-10-29 17:58:33 +02:00
AudricV	c230d84df1	Fix Checkstyle error in YoutubeCommentsInfoItemExtractor	2022-10-29 13:24:19 +02:00
xz-dev	0ffcb32d9c	[YouTube] Add comment reply count support (#936 ) Add comment reply count support for YouTube and introduce `CommentsInfoItem.UNKNOWN_REPLY_COUNT` constant Co-authored-by: AudricV <74829229+AudricV@users.noreply.github.com> Co-authored-by: Tobi <TobiGr@users.noreply.github.com>	2022-10-15 12:40:06 +02:00
Isira Seneviratne	b90a566dd8	Add backport implementation of Locale.forLanguageTag().	2022-10-12 09:21:39 +05:30
Isira Seneviratne	b232c29d22	Use Locale.forLanguageTag().	2022-10-12 09:21:38 +05:30
TobiGr	4d136599bd	[SoundCloud] Fix getting more comments	2022-10-11 15:44:54 +02:00
Tobi	a822e91909	Merge pull request #939 from TurtleArmyMc/fix/SoundcloudPlaylistExtractor_track_order Fix SoundcloudPlaylistExtractor: tracks are in correct order	2022-10-10 22:28:18 +02:00
TobiGr	02810a7db7	Add a comment	2022-10-10 22:22:12 +02:00
Tobi	54092fc3c7	Merge pull request #930 from Isira-Seneviratne/Avoid_Comparator_NPE Avoid possible NullPointerException in MediaCCCRecentKiosk.	2022-10-10 11:43:34 +02:00
TurtleArmyMc	bf70d32eb4	Fix SoundcloudPlaylistExtractor: tracks are in correct order	2022-09-30 15:26:25 -04:00
AudricV	8067c43837	[YouTube] Don't use a specific letter for the decryption function name pattern Use the same possible characters for variables everywhere, in order to avoid potential future throttling parameter decryption function name parsing issues related to the usage of other letter(s) than b.	2022-09-24 21:49:22 +02:00
AudricV	abcee87167	[YouTube] Fix throttling parameter decryption function regex - Quote the function name, as it may contain special regex symbols, such as dollar; - Support multiple lines; - Use what looks like the end of the function for the end of the regex (this part is inspired from yt-dlp throttling parameter decryption regex); - Move the throttling function body regex into a private and static constant.	2022-09-24 21:28:09 +02:00
ThetaDev	7244be7627	[YouTube] Add JavaScript lexer to parse completely throttling decryption function (#905 )	2022-09-24 19:55:46 +02:00
Isira Seneviratne	0e31c86aee	Avoid possible NullPointerException in MediaCCCRecentKiosk.	2022-09-21 05:40:07 +05:30
Stypox	a99af9bb6e	Merge pull request #927 from Stypox/fix-feed-thumbnail [YouTube] Return mqdefault thumbnails in fast feed	2022-09-19 08:52:01 +02:00
Kavin	4e14644c41	Ensure comments are parsed with URLs and timestamps.	2022-09-17 16:03:39 +05:30
Stypox	93ef73e2f2	[YouTube] Return mqdefault thumbnails in feed The hqdefault thumbnails, which are used by default, have black bars at the top and at the bottom, and are thus not optimal.	2022-09-13 16:46:54 +02:00
AudricV	d8ddeb549c	[YouTube] Support new trending structure and filter recently trending section This new structure allow us to filter easily Trending shorts and Recently trending sections. On the previous one, this Recently trending section is now filtered, by checking whether sections have a title, which isn't the case for normal trends contrary to the other ones. This makes that the extractor returns now only the real 50 "Now" YouTube trends. Elements inside arrays are now extracted dynamically instead of only the ones of the first index, using Java 8's Stream API. The getInitialPage() method of YoutubeTrendingExtractor can now throw a ParsingException if no selected tab (corresponding to the one of the trends type extracted) has been found. Finally, the licence header has been moved to the top of the file and updated.	2022-09-11 18:52:12 +02:00
AudricV	884da40f65	Merge pull request #926 from AudricV/fix-yt-like-count-extraction [YouTube] Fix extraction of like count with the new data model	2022-09-11 12:21:38 +02:00
AudricV	119b9129e2	[YouTube] Fix extraction of like count with the new data model In the new data model currently A/B tested or deployed, the like count button data is the same, but its path has been changed. The extraction of the like count has been also improved, by using the multiple accessibility data available instead of the only one which was used before. This accessibility data has been also deprioritized, because it is language dependent when there is no view. Like count is always known, even for age-restricted videos, so returning -1 if the like count extraction fails on this type of videos is not needed and hides an extraction error: that's the reason why this return has been removed and the exception will always be thrown, even if a video is age-restricted.	2022-09-10 23:28:38 +02:00
ThetaDev	4905f74700	[YouTube] Fix throttling parameter decryption on Android Escape the curly brace in the regular expression used to parse the throttling parameter decryption function to allow its compatibility on Android.	2022-09-10 17:03:39 +02:00
Stypox	fa25a7280f	Downgrade Rhino since it uses a class not available on Android Reverts #774, see TeamNewPipe/NewPipe#8876	2022-08-25 09:49:13 +02:00
Stypox	25f0ce7305	Merge pull request #911 from TeamNewPipe/dependabot/gradle/org.jsoup-jsoup-1.15.3 Bump jsoup from 1.15.2 to 1.15.3	2022-08-24 14:23:44 +02:00
dependabot[bot]	1d7a3a90a4	Bump jsoup from 1.15.2 to 1.15.3 Bumps [jsoup](https://github.com/jhy/jsoup) from 1.15.2 to 1.15.3. - [Release notes](https://github.com/jhy/jsoup/releases) - [Changelog](https://github.com/jhy/jsoup/blob/master/CHANGES) - [Commits](https://github.com/jhy/jsoup/compare/jsoup-1.15.2...jsoup-1.15.3) --- updated-dependencies: - dependency-name: org.jsoup:jsoup dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2022-08-24 09:14:26 +00:00
Isira Seneviratne	41254ae12a	Remove annotation.	2022-08-24 06:59:17 +05:30
Isira Seneviratne	943b7c033b	Remove EMPTY_STRING.	2022-08-24 06:59:17 +05:30
litetex	d6577e5e0b	Merge pull request #882 from litetex/fix-all-tests Fix all tests	2022-08-23 16:19:27 +02:00
AudricV	03d9a4fe9d	Fix typos in YoutubeStreamExtractor.tryDecryptUrl	2022-08-21 21:17:03 +02:00
litetex	0beb55a232	Improve ``YoutubeThrottlingDecrypterTest``	2022-08-21 20:17:05 +02:00
litetex	9e93d6b193	YoutubeThrottlingDecrypter: Check if returned string is a valid JavaScript function	2022-08-21 20:16:45 +02:00
litetex	ed0a07af4e	YoutubeThrottlingDecrypter: Patch regex	2022-08-21 20:15:53 +02:00
litetex	641a447d9b	Updated consent-related mock-data	2022-08-21 18:43:48 +02:00
litetex	8ff7a90f52	Improved consent cookie related constants and documentation	2022-08-21 18:41:40 +02:00
AudricV	cb64a480cd	[YouTube] Remove deprecated code in YoutubeThrottlingDecrypter YoutubeThrottlingDecrypter is now a non-instantiable final class and non-static attributes have been removed. The static attributes of this class have been renamed, in order to respect the naming specification used.	2022-08-16 15:01:05 +02:00
litetex	5bbea3a8f2	Fix ``YoutubeMixPlaylistExtractorTest`` again Doesn't work for kurgesagt videos anymore, used music video (copyright free) instead	2022-08-14 15:12:27 +02:00
litetex	da06166065	Updated mock data of ``YoutubeSearchExtractorTest$Suggestion``	2022-08-14 15:12:26 +02:00
litetex	2e36ab1578	Fixed ``YoutubeSearchExtractorTest$Suggestion``	2022-08-14 15:12:26 +02:00
litetex	2a8a623643	``YoutubePlaylistLinkHandlerFactoryTest``: Use parameterized tests	2022-08-14 14:48:31 +02:00
litetex	938e69a16a	Fixed ``YoutubePlaylistLinkHandlerFactoryTest``	2022-08-14 14:48:30 +02:00
litetex	844de3e378	Fix ``YoutubeStreamLinkHandlerFactoryTest `` and parameterized the tests used code from #833	2022-08-14 14:48:30 +02:00
litetex	c1a72b8240	Use Android Studios code style	2022-08-14 14:48:29 +02:00
litetex	fc27b8a5b8	Use preexisting ``ContentNotAvailableException``	2022-08-14 14:48:29 +02:00
Stypox	17bad6cedf	Fix test about exception type: NPE, not IllegalArgment The relevant change was made in #877	2022-08-14 14:48:28 +02:00
litetex	ecfc370685	Fixed all YTMixPlaylists Added option to choose if you want to consent or not - currently this is done by a static variable in ``YoutubeParsingHelper`` - may not be the best long-term solution but for now the tests work again (in EU countries) 🥳	2022-08-14 14:48:27 +02:00
litetex	2b6fe294b2	Fixed ``YoutubeMusicSearchExtractorTest`` The renderers and queries got changed.	2022-08-14 14:48:27 +02:00
litetex	d6586da614	Failing test works locally without any problems. Reformatted it a bit.	2022-08-14 14:48:27 +02:00
litetex	bf3ae5e679	Fixed ``SoundcloudStreamLinkHandlerFactoryTest`` * Removed the unnecessary ``public`` * Use parameterized tests * One song got removed (which caused the test failure), replaced it with another song	2022-08-14 14:48:27 +02:00
litetex	504f81036e	Fixed ``PeertubePlaylistExtractorTest`` Also removed the unnecessary ``public`` and reformatted the code a bit	2022-08-14 14:48:26 +02:00
litetex	e7c12258f4	Fixed ``BandcampSearchExtractorTest`` Also removed the unnecessary ``public``	2022-08-14 14:48:26 +02:00
litetex	9e8724df4d	Fixed ``YoutubeStreamExtractorLivestreamTest`` The "Lofi Girl"-stream got interrupted by a copyright strike and had to be restarted. Because of this a new id is now used.	2022-08-14 14:48:26 +02:00
AudricV	472f5d9e9c	[YouTube] Update mocks	2022-08-12 19:20:32 +02:00
AudricV	7bdca33a87	[YouTube] Ensure that an additional player response is the correct one If YouTube detect that requests come from a third party client, they may replace the real player response by another one of a video saying that this content is not available on this app and to watch it on the latest version of YouTube. We can detect this by checking whether the video ID of the player response returned is the same as the one requested by the extractor.	2022-08-12 19:20:31 +02:00
AudricV	c82317e318	[YouTube] Spoof more mobile clients Additional parameters have been added to the player requests of ANDROID and IOS clients: - for both clients: osName and osVersion: their respective values are: - for the ANDROID one: Android and 12; - for the IOS one: iOS and 15.6.0.19G71. - for the ANDROID client: androidTargetSdkVersion, with the Android SDK version corresponding to the Android version used in the player requests of this client. This parameter is now required with this client to be sure to get a correct player response, otherwise, the one of a video saying that this content is not available in this app and to watch it with the latest version of YouTube can be returned instead; - for the IOS client: deviceMake, with Apple as its value. The iOS version sent in the IOS client player requests has been also updated to the version 15.6 of the OS. Finally, a comment about the requirement to use the signature timestamp from the player JavaScript base file for HTML5 player requests on videos with obfuscated URLs has been added and replaces a previous one which may be not true.	2022-08-12 19:20:31 +02:00
AudricV	d0549a5a52	[YouTube] Update client versions and use a real version for the iOS client The iOS version can be got easily in fact, by looking at the What's New section of the App Store' app page.	2022-08-12 19:20:31 +02:00
AudricV	d7e678aca2	[YouTube] Improve WEB client version and API key HTML extraction Common code in WEB client version HTML extraction has been deduplicated, usage of the Java 8 Stream API has been made and initial data fallback has been used as a last resort. This means that the client version extraction from regexes will be used before this fallback, as it doesn't contain the full client version. This can be used as a way to fingerprint the extractor, even if it seems to be not the case.	2022-08-12 19:20:30 +02:00
AudricV	5b548340e8	[YouTube] Catch any exception in YoutubeThrottlingDecrypter.apply and improve docs This will prevent any future extractor break due to decryption failure, like it was excepted to be the case before. Some documentation about the throttling decryption has been also improved.	2022-08-12 17:49:36 +02:00
ThetaDev	52ded6e3d7	Handle curly braces inside strings in StringUtils.matchToClosingParenthesis This is required to extract fully more complex YouTube nsig functions.	2022-08-12 16:32:00 +02:00
Stypox	d12003651b	Merge pull request #886 from Isira-Seneviratne/toArray_improvements Make improvements to methods using toArray().	2022-08-06 22:34:23 +02:00
Isira Seneviratne	7daca10a06	Make improvements to methods using toArray().	2022-08-06 05:21:12 +05:30
Stypox	2906be22af	Merge pull request #881 from Isira-Seneviratne/String_join Use String.join() and Collectors.joining().	2022-08-04 12:09:33 +02:00
Stypox	4ddb96a86f	Merge pull request #883 from TeamNewPipe/dependabot/gradle/com.google.code.gson-gson-2.9.1 Bump gson from 2.9.0 to 2.9.1	2022-08-04 11:20:41 +02:00
Isira Seneviratne	64771c5712	Use String.join() and Collectors.joining().	2022-08-04 05:18:13 +05:30
Stypox	fc8b5ebbc6	Merge pull request #878 from Isira-Seneviratne/Use_Collections Use Collections methods.	2022-08-03 22:50:41 +02:00
dependabot[bot]	325af31e5f	Bump gson from 2.9.0 to 2.9.1 Bumps [gson](https://github.com/google/gson) from 2.9.0 to 2.9.1. - [Release notes](https://github.com/google/gson/releases) - [Changelog](https://github.com/google/gson/blob/master/CHANGELOG.md) - [Commits](https://github.com/google/gson/compare/gson-parent-2.9.0...gson-parent-2.9.1) --- updated-dependencies: - dependency-name: com.google.code.gson:gson dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2022-08-01 09:26:14 +00:00
Isira Seneviratne	1af6b8eedb	Use Collections.singletonList().	2022-07-27 07:35:57 +05:30
Isira Seneviratne	ff60e05c76	Use Collections.singletonMap().	2022-07-27 07:35:57 +05:30
Isira Seneviratne	682a4263e5	Use Objects.requireNonNull().	2022-07-27 06:55:26 +05:30
Stypox	954a294e27	Merge pull request #870 from TeamNewPipe/dependabot/gradle/org.jsoup-jsoup-1.15.2 Bump jsoup from 1.15.1 to 1.15.2	2022-07-13 17:48:06 +02:00
Stypox	5ab74b3631	Merge pull request #857 from FireMasterK/video-title Get original untranslated title for YouTube	2022-07-06 10:26:45 +02:00
dependabot[bot]	122365005a	Bump jsoup from 1.15.1 to 1.15.2 Bumps [jsoup](https://github.com/jhy/jsoup) from 1.15.1 to 1.15.2. - [Release notes](https://github.com/jhy/jsoup/releases) - [Changelog](https://github.com/jhy/jsoup/blob/master/CHANGES) - [Commits](https://github.com/jhy/jsoup/compare/jsoup-1.15.1...jsoup-1.15.2) --- updated-dependencies: - dependency-name: org.jsoup:jsoup dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2022-07-04 09:38:16 +00:00
AudricV	090debd83b	[YouTube] Fetch the ANDROID client for ended/post livestreams The ANDROID client was only fetched for video contents, where it can be useful on ended/post livestreams, if the n parameter of the WEB client cannot be decrypted, to avoid throttling issues (because the WEB client was only used before for ended/post livestreams). It also provides an exclusive 48kbps M4A audio format in the adaptiveFormats array of the JSON player response, like other mobile clients (which can be also extracted from the response of the DASH manifest URL returned into the WEB client player's response, but the DASH manifest is not used by the extractor). A note about non-fatality of fetching or parsing issues of the ANDROID and IOS clients has been added.	2022-06-21 18:53:49 +02:00
dependabot[bot]	281d2b9f81	Bump jsoup from 1.14.3 to 1.15.1 Bumps [jsoup](https://github.com/jhy/jsoup) from 1.14.3 to 1.15.1. - [Release notes](https://github.com/jhy/jsoup/releases) - [Changelog](https://github.com/jhy/jsoup/blob/master/CHANGES) - [Commits](https://github.com/jhy/jsoup/compare/jsoup-1.14.3...jsoup-1.15.1) --- updated-dependencies: - dependency-name: org.jsoup:jsoup dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2022-06-19 13:15:40 +00:00
litetex	f775155d25	Merge pull request #846 from litetex/remove-unused-methods Remove unused methods	2022-06-19 15:12:15 +02:00
Gábor Lipták	28c9340d69	Correct unit tests	2022-06-18 11:18:38 -04:00
AudricV	301a795ed3	[SoundCloud] Remove completely workaround for HLS streams SoundCloud is currently removing this workaround completely, so there is no need to keep it, because it impacts the loading time (a HLS playlist was downloaded and parsed).	2022-06-16 12:12:54 +02:00
AudricV	e960a417ec	[YouTube] Fix extraction of fps, audioSampleRate and audioChannels fields for ItagItems of live streams and post live streams These values were only set before for video streams. A fallback for the audio channels count has been added, in order to prevent exceptions when generating DASH manifests of audio streams: the fallback value is 2, because most audio streams on YouTube have 2 audio channels.	2022-06-16 12:12:54 +02:00
Kavin	7635aeed2c	Get original untranslated title for YouTube.	2022-06-02 09:57:52 +01:00
TiA4f8R	287d1dfd63	[SoundCloud] Use the HLS delivery method for all streams and extract only a single stream URL from HLS manifest for MP3 streams SoundCloud broke the workaround used to get a single file from HLS manifests for Opus manifests, but it still works for MP3 ones. The code has been adapted to prevent an unneeded request (the one to the Opus HLS manifest) and the HLS delivery method is now used for SoundCloud MP3 and Opus streams, plus the progressive one (for tracks which have a progressive stream (MP3) and for the ones which doesn't have one, it is still used by trying to get a progressive stream, using the workaround). Streams extraction has been also moved to Java 8 Stream's API and the relevant test has been also updated.	2022-05-29 19:08:18 +02:00
Stypox	b3c620f0d8	Apply code review and Streams rework	2022-05-28 12:00:58 +02:00
Stypox	d652e05874	[MediaCCC] Fix comments about containsSimilarStream	2022-05-28 12:00:58 +02:00
Stypox	044639c32b	Solve some review comments	2022-05-28 12:00:57 +02:00
litetex	c33d392958	Fixed typo XEE → XXE (Xml eXternal Entity attack) See also: https://en.wikipedia.org/wiki/XML_external_entity_attack https://owasp.org/www-community/vulnerabilities/XML_External_Entity_(XXE)_Processing	2022-05-28 12:00:57 +02:00
TiA4f8R	fffbbee7f3	[YouTube] Add missing Nonnull annotations in getCache method of YouTube DASH manifest creators	2022-05-28 12:00:56 +02:00
TiA4f8R	f7b1515290	[YouTube] Refactor DASH manifests creation Move DASH manifests creation into a new subpackage of the YouTube package, dashmanifestcreators. This subpackage contains: - CreationException, exception extending Java's RuntimeException, thrown by manifest creators when something goes wrong; - YoutubeDashManifestCreatorsUtils, class which contains all common methods and constants of all or a part of the manifest creators; - a manifest creator has been added per delivery type of YouTube streams: - YoutubeProgressiveDashManifestCreator, for progressive streams; - YoutubeOtfDashManifestCreator, for OTF streams; - YoutubePostLiveStreamDvrDashManifestCreator, for post-live DVR streams (which use the live delivery method). Every DASH manifest creator has a getCache() static method, which returns the ManifestCreatorCache instance used to cache results. DeliveryType has been also extracted from the YouTube DASH manifest creators part of the extractor and moved to the YouTube package. YoutubeDashManifestCreatorTest has been updated and renamed to YoutubeDashManifestCreatorsTest, and YoutubeDashManifestCreator has been removed. Finally, several documentation and exception messages fixes and improvements have been made.	2022-05-28 12:00:56 +02:00
TiA4f8R	f17f7b9842	Apply requested changes in YoutubeParsingHelper	2022-05-28 12:00:55 +02:00
TiA4f8R	301b9fa024	Remove hashCode and equals methods overrides of Stream classes	2022-05-28 12:00:55 +02:00
TiA4f8R	2f3920c648	[YouTube] Return approxDurationMs value from YouTube's player response in ItagItems generated This change allows to build DASH manifests using YoutubeDashManifestCreator with the real duration of streams and prevent potential cuts of the end of progressive streams, because the duration in YouTube's player response is in seconds and not milliseconds.	2022-05-28 12:00:54 +02:00
TiA4f8R	4158fc46a0	[Bandcamp] Fix regression of Opus radio streams extraction When moving opus-lo into a constant, opus-lo was renamed to opus_lo and was only used if no MP3 stream was available (which was not the case before the changes in BandcampRadioStreamExtractor related to the addition of the support of all delivery methods), so these changes removed the ability to get Opus streams of Bandcamp radios. This commit reverts this unwanted change.	2022-05-28 12:00:54 +02:00
TiA4f8R	54d323c2ae	Fix Checkstyle issue in YoutubeDashManifestCreator	2022-05-28 12:00:53 +02:00
Stypox	2321822844	Rename Stream's baseUrl to manifestUrl	2022-05-28 12:00:53 +02:00
Stypox	cfc13f4a6f	[YouTube] Reduce exception generation code and move several attributes of MPD documents into constants Rename YoutubeDashManifestCreationException to CreationException. Also use these constants in YoutubeDashManifestCreatorTest.	2022-05-28 12:00:53 +02:00
Stypox	00bbe5eb4d	[YouTube] Make exception messages smaller	2022-05-28 12:00:52 +02:00
Stypox	4da05afe11	[YouTube] Inline collectSegmentsData in YoutubeDashManifestCreator	2022-05-28 12:00:52 +02:00
Stypox	3708ab9ed5	[YouTube] Refactor YoutubeDashManifestCreator - Remove all of the methods used to access caches and replace them with three caches getters - Rename caches to shorter and more meaningful names - Remove redundant @throws tags that just say "if this method fails to do what it should do", which is obvious	2022-05-28 12:00:51 +02:00
Stypox	5c83409039	[YouTube] Rewrite manifest test and rename long methods	2022-05-28 12:00:51 +02:00
Stypox	8226fd044f	[YouTube] Suppress Sonar security warning for XEE	2022-05-28 12:00:50 +02:00
Stypox	ba68b8c014	[YouTube] Secure DashManifestCreator against XEE attack Also remove duplicate lines from javadoc comments, otherwise checkstyle complaints. XEE prevention is inspired from https://github.com/ChuckerTeam/chucker/pull/201/files For an explanation of XEE see https://rules.sonarsource.com/java/RSPEC-2755, though the solution reported there causes crashes on Android	2022-05-28 12:00:50 +02:00
Stypox	159d05c91b	Remove unused DashMpdParser (but kept in git history)	2022-05-28 12:00:49 +02:00
Stypox	50272db946	Apply reviews: improve comments, remove FILE, remove Stream#equals(Stream)	2022-05-28 12:00:49 +02:00
TiA4f8R	07b045f20d	[YouTube] Support the iOS client in YoutubeDashManifestCreator and decrypt again the n parameter, if present, for all clients This commits reverts a new behavior introduced in this branch, which only applied the decryption if needed on streams from the WEB client. Also fix rebase issues and documentations style in YoutubeDashManifestCreator.	2022-05-28 12:00:48 +02:00
TiA4f8R	436ddde29f	Use assertThrows in YoutubeDashManifestCreatorTest	2022-05-28 12:00:48 +02:00
TiA4f8R	d64d7bbd01	Move ManifestCreatorCache tests to a separate class and remove override of equals and hashCode methods in ManifestCreatorCache These methods don't need to be overriden, as they are not excepted to be used in collections. Also improve the toString method of this class, which contains also now clearFactor and maximumSize attributes and for each operations.	2022-05-28 12:00:48 +02:00
TiA4f8R	2fb1a412a6	Fix Checkstyle issues, revert resolution string changes for YouTube video streams and don't return the rn parameter in DASH manifests This parameter is still used to get the initialization sequence of OTF and POST-live streams, but is not returned anymore in the manifests. It has been removed in order to avoid fingerprinting based on the number sent (e.g. when starting to play a stream close to the end and using 123 as the request number where it should be 1) and should be added dynamically by clients in their requests. The relevant test has been also updated. Checkstyle issues in YoutubeDashManifestCreator have been fixed, and the changes in the resolution string returned for video streams in YoutubeStreamExtractor have been reverted, as they create issues on NewPipe right now.	2022-05-28 12:00:47 +02:00
TiA4f8R	f61e2092a1	[YouTube] Return a copy of the hardcoded ItagItem instead of returning the reference to the hardcoded one in ItagItem.getItag To do so, a copy constructor has been added in the class. This fixes, for instance, an issue in NewPipe, in which the ItagItem values where not the ones corresponsing to a stream but to another, when generating DASH manifests.	2022-05-28 12:00:47 +02:00
TiA4f8R	aa4c10e751	Improve documentation and adress most of the requested changes Also fix some issues in several places, in the code and the documentation.	2022-05-28 12:00:46 +02:00
TiA4f8R	6985167e63	Add tests for YoutubeDashManifestCreator and ManifestCreatorCache The test added in YoutubeDashManifestCreator uses a video of the Creative Commons channel, licenced under the Creative Commons Attribution licence (reuse allowed). Also remove public keywords of tests in UtilsTest, as suggested by SonarLint, because they are not needed with Junit 5.	2022-05-28 12:00:46 +02:00
TiA4f8R	f6ec7f9a61	Update DefaultStreamExtractorTest and SoundcloudStreamExtractorTest to support changes made in Stream classes	2022-05-28 12:00:45 +02:00
TiA4f8R	7477ed0f3d	[YouTube] Add ability to generate manifests of progressive, OTF and post live streams A new class has been added to do so: YoutubeDashManifestCreator. It relies on a new class: ManifestCreatorCache, to cache the content, which relies on a new pair class named Pair. Results are cached and there is a cache per delivery type, on which cache limit, clear factor, clearing and resetting can be applied to each cache and to all caches. Look at code changes for more details.	2022-05-28 12:00:45 +02:00
TiA4f8R	a857684442	Apply changes in YoutubeStreamExtractor Extract post live DVR streams as post live streams instead of live streams. A new class has been in order to improve code: ItagInfo, which stores an itag, the content (URL) extracted and if its an URL or not. A functional interface has been added in order to abstract the stream building: StreamBuilderHelper. Also add the cver parameter added by the desktop web client on the corresponding streams (a new method has been added in YoutubeParsingHelper to check this and another for Android streams). Some code in these classes has been also refactored/improved/optimized.	2022-05-28 12:00:44 +02:00
TiA4f8R	4330b5f7be	Add POST_LIVE_STREAM and POST_LIVE_AUDIO_STREAM stream types This allows the extractor to determine if a content is an ended audio or video livestream.	2022-05-28 12:00:43 +02:00
TiA4f8R	881969f1da	Apply changes in all StreamExtractors except YouTube's one and fix extraction of PeerTube audio streams as video streams Some code in these classes has been also refactored/improved/optimized. Also fix the extraction of PeerTube audio streams as video streams, which are now returned as audio streams.	2022-05-28 12:00:43 +02:00
TiA4f8R	d5f3637fc3	[YouTube] Return more values returned inside the ItagItems of the player response and deprecate use of public audio and video fields These fields can be now replaced by a getter and a setter. New fields have been added and will allow the creation of DASH manifests for OTF and ended livestreams. There are: - contentLength; - approxDurationMs; - targetDurationSec; - sampleRate; - audioChannels.	2022-05-28 12:00:42 +02:00
TiA4f8R	7c67d46e09	Move DashMpdParser to the YouTube package and fix extraction of streams DashMpdParser is only working with YouTube streams, as it uses the ItagItem class. Also update creation of AudioStreams and VideoStreams objects.	2022-05-28 12:00:41 +02:00
TiA4f8R	ad993b920f	Remove fetching of the DASH manifest extracted when getting information of a content with StreamInfo DashMpdParser is only working with YouTube streams, as it uses the ItagItem class. Also improve code and comments of StreamInfo (especially final use where possible).	2022-05-28 12:00:41 +02:00
TiA4f8R	2f061b8dbd	Add support of other delivery methods than progressive HTTP in Stream classes Stream constructors are now private and streams can be constructed with new Builder classes per stream class. This change has been made to prevent creating and using several constructors in stream classes. Some default cases have been also added in these Builder classes, so not everything has to be set, depending of the service and the content.	2022-05-28 12:00:27 +02:00
TiA4f8R	c34b5e3a8b	[YouTube] Fix extraction of YouTube Music client version and API key when using YouTube Music's website in EU Google returns now the consent page of YouTube for YouTube Music in EU, which can be also avoided by adding the ucbcb parameter to the URL with the value 1 ("?ucbcb=1").	2022-05-15 11:20:06 +02:00
litetex	2015eb374a	Removed more unused methods	2022-05-09 21:05:03 +02:00
litetex	f69b0ff77b	Remove unused methods	2022-05-09 20:59:25 +02:00
TiA4f8R	3c3cd78676	Remove Checkstyle suppressions file and fix Checkstyle issues introduced in `24e8399` and `8c1041d` The Checkstyle suppressions file is now replaced by // CHECKSTYLE:OFF and // CHECKSTYLE:ON comments.	2022-05-02 21:51:25 +02:00
Stypox	2e1c5c119d	Merge pull request #822 from Stypox/more-refactors More refactors	2022-05-02 19:03:54 +02:00
Stypox	598ebb92ea	Merge pull request #839 from TeamNewPipe/bandcamp/extract-length Bandcamp: extract stream length	2022-05-02 15:49:41 +02:00
litetex	5db4d1faf3	Merge pull request #782 from litetex/cleanup-yt-stream-extractor Cleanup of ``YoutubeStreamExtractor`` and some related classes	2022-05-01 16:44:11 +02:00
litetex	fe30eb43a9	Cleanup ``YoutubeStreamExtractor`` and some related classes * Fixed obvious sonar(lint) warnings * Abstracted some code (getStreams) Used some new lines to make code better readable * Chopped down brace-jungle in some methods * Use StandardCharset (Java 8 4tw)	2022-05-01 16:39:07 +02:00
Stypox	c2b5370517	Apply suggestions: improve switch and use EMPTY_STRING	2022-04-30 16:39:51 +02:00
Stypox	7c78c39230	Merge pull request #821 from litetex/cleanup-TimeAgoParser-java Cleanup ``TimeAgoParser``	2022-04-30 16:20:09 +02:00
TiA4f8R	9f9af35adb	[YouTube] Fix regression introduced in the order of streams used when adding more parameters to InnerTube requests, using the iOS client for livestreams and more	2022-04-25 20:23:04 +02:00
Fynn Godau	c38c016de5	Bandcamp: extract stream length	2022-04-24 21:24:19 +02:00
Nickoriginal	b5f3f9eb90	Update USER_AGENT to match the latest Firefox	2022-04-24 00:27:23 +03:00
Stypox	52fa2d939a	Fix javadoc formatting error causing deployment to fail	2022-04-16 17:07:07 +02:00
Stypox	dcb7483dcf	Fix YouTube throttling decrypter function parsing	2022-04-15 13:10:19 +02:00
TiA4f8R	ef49cd0007	[YouTube] Extract subtitles for age-restricted videos Subtitles of age-restricted videos can be extracted since the InnerTube API migration, so there is no reason to not extract them anymore.	2022-04-11 22:09:56 +02:00
TiA4f8R	b30e341559	Add link to learn more about the issue which makes YoutubeStreamExtractorRelatedMixTest.testRelatedItems test disabled	2022-04-04 20:36:59 +02:00
TiA4f8R	70812fa611	Update YouTube stream mocks and disable YoutubeStreamExtractorRelatedMixTest Mixes seems to be not given by YouTube anymore if you use a PENDING consent cookie value. As mocks needs to updated, the test is always failing because of this change.	2022-04-02 19:28:49 +02:00
TiA4f8R	67288a0191	[YouTube] Fix extraction of embeddable age-restricted videos, fix extraction of contents with warnings and more Use the TV embedded client technique to get streams of embeddable age-restricted videos. This client doesn't provide the playerMicroFormatRenderer object in the player response, but it is still returned on the WEB player response, even for unavailable (but non-private) contents, so we need now to store it, as we are replacing the player response from the WEB client by the TV embedded one. Otherwise, some metadata such as the unlisted property, category, the uploadDate and the publishDate properties. The outdated code for these contents has been removed. Add the racyCheckOk and contentCheckOk to player and next requests to the InnerTube API. The first doesn't seem to make any difference when used anonymously, but the second one is needed to get streams of contents with a warning before they can be played. Also apply some requested changes, fixes and improvements in YoutubeParsingHelper and YoutubeStreamExtractor.	2022-04-02 19:06:36 +02:00
TiA4f8R	11b5a222c4	Deduplicate code of getStringResultFromRegexArray methods in Utils Also revert indentation in Utils.mixedNumberWordToLong.	2022-04-02 18:40:00 +02:00
TiA4f8R	9ca647a750	Update mocks	2022-03-27 22:10:59 +02:00
TiA4f8R	dfa4239661	Fix missing imports and Checkstyle issues	2022-03-27 22:10:57 +02:00
TiA4f8R	6d27996ac4	Improve code of getStringResultFromRegexArray methods in Utils	2022-03-27 22:10:57 +02:00
TiA4f8R	2e3da445e6	[YouTube] Add documentation about parameters added and clients versions and key Also move the iPhone device machine id to a constant, explain how it is used and move the licence in the header of the file, and fix missing imports in YoutubeStreamExtractor (due to a rebase issue).	2022-03-27 22:10:57 +02:00
TiA4f8R	1dad3bfe8b	[YouTube] Update again hardcoded client versions and update mobile user agents Also provide ability to get mobile user-agents used for mobile InnerTube requests and deduplicate related code.	2022-03-27 20:52:40 +02:00
TiA4f8R	3d38459cf3	[YouTube] Reduce InnerTube response sizes by adding the prettyPrint parameter with the false value InnerTube responses return pretty printed responses, which increase responses' size for nothing. By using the prettyPrint parameter on requests and setting its value to false, responses are not pretty printed anymore, which reduces responses size, and so data transfer and processing times. This usage has been recently deployed by YouTube on their websites.	2022-03-27 20:52:40 +02:00
litetex	349ba8db7f	Improve tests and randomness - Use the existing RNG inside YoutubeParsingHelper - Deduplicated test-setup for YouTube tests - Minor improvements	2022-03-27 20:52:38 +02:00
TiA4f8R	52376949e5	Add setSeedForVideoTests method in YoutubeStreamExtractor tests In order to use still use mocks with the generation of random strings in player requests, we need to use YoutubeParsingHelper.setSeedForVideoTests() method in every stream test.	2022-03-27 20:51:39 +02:00
TiA4f8R	d0d91e6690	Adress requested changes	2022-03-27 20:51:39 +02:00
TiA4f8R	b6bc521f0d	[YouTube] Update client versions again	2022-03-27 20:51:38 +02:00
TiA4f8R	26f93f5bb0	[YouTube] Extract streams of livestreams from the iOS client and disabled the Android client for livestreams The iOS client is only enabled for livestreams and the Android client is now only enabled for videos, both by default. A way to force, or not, the fetch of both clients have been added with two new static methods in YoutubeStreamExtractor.	2022-03-27 20:51:38 +02:00
TiA4f8R	7d07924de8	[YouTube] Try to use lighter requests when extracting client version and key from YouTube and YouTube Music This is done by fetching https://www.youtube.com/sw.js for YouTube and https://music.youtube.com/sw.js for YouTube Music. Two new methods in Utils class have been added which allow to try to get a match of regular expressions in a string array, or a Pattern array, on a content, on a specific index or 0. Also some code refactoring has been made in this class.	2022-03-27 20:51:38 +02:00
TiA4f8R	05b7fee23b	[YouTube] Add the cpn param to playback requests and try to spoof better the Android client The cpn param, aka the content playback nonce param, is a parameter sent by YouTube web client in videoplayback requests, and for some of them, in the player request body. This PR adds it everywhere. For the desktop/WEB client, some params were missing from the playbackContext object, which seemed (or not) to make YouTube throttle streams extracted from the WEB client. This PR adds them. Fingerprinting on the WEB client basing on the client version used is not possible anymore, because the latest client version is extracted at the first time of a YouTube request on a session which require the extractor to fetch again the website (and this may come back the reCaptcha issues again unfortunately, but it seems there is no other way to get it). For the Android client, the video id is now also sent as a query parameter, like a 12 characters string, in the t query parameter, in order to spoof better this client. Researches need to be done on this parameter, unique to each request, and how it is generated by clients. This commit also fixes a small bug with the Android User-Agent string. Some code improvements have been also made.	2022-03-27 20:51:38 +02:00
TiA4f8R	83f374bff1	[YouTube] Update client versions and fix a bug when using resetClientVersionAndKey method The boolean keyAndVersionExtracted in YoutubeParsingHelper was not set to false when resetting the client version and the key, which makes the extractor uses null on the next getting of the client version or the key if the clientVersion and the key were extracted before. Also update client versions.	2022-03-27 20:51:38 +02:00
Stypox	8c1041def6	Add @ null annotations where Android Studio suggested it That is, basically where the overriding function was missing an annotation from the base method. Also apply renaming of emptyDescription to EMPTY_DESCRIPTION	2022-03-26 22:07:14 +01:00
Stypox	adbbdc7a5b	[YouTube] Fix regex warning: use ' {2}' instead of ' '	2022-03-26 22:07:14 +01:00
Stypox	24e83997b4	[Bandcamp] Add Java 8 streams	2022-03-26 22:07:12 +01:00
Stypox	349990fd48	Fix redundant escape \\ in regex in Utils	2022-03-26 22:01:30 +01:00
litetex	3bf7aa3762	Cleanup ``TimeAgoParser``	2022-03-26 21:09:31 +01:00
litetex	af82edf9dc	Fix checkstyle problems	2022-03-26 20:54:20 +01:00
litetex	29408f9356	``YoutubeMusicSearchExtractorTest$Suggestion`` has no mock data	2022-03-26 20:52:28 +01:00
litetex	fcee247f9f	Switched to mockonly tests * Made the reason why the tests are mockonly mandatory	2022-03-26 20:52:28 +01:00
litetex	0fceb4686b	Added missing mock data	2022-03-26 20:52:27 +01:00
litetex	70b20f0d6b	Updated mock data after conflicts	2022-03-26 20:52:27 +01:00
litetex	6a6c9359af	Fixed kurzgesagt test - They changed their description...	2022-03-26 20:52:27 +01:00
litetex	26596215fa	Refactored ``MediaCCCRecentListExtractorTest``	2022-03-26 20:52:26 +01:00
litetex	53962bfd7d	Disabled suggestion test for now Has to be done once YT's backed works again...	2022-03-26 20:52:26 +01:00
litetex	66dc5e8bb8	API hardening against changes	2022-03-26 20:52:26 +01:00
litetex	5d58156cde	Update mock data	2022-03-26 20:52:25 +01:00
litetex	7598b40957	Workaround for incorrect duration for "YT shorts" videos in channels As a workaround 0 is returned as duration for such videos. See also https://github.com/TeamNewPipe/NewPipe/issues/8034	2022-03-26 20:52:24 +01:00
litetex	164e21b5af	Fixed ``MediaCCCRecentKiosk`` Ignore faulty data/items (with duration <= 0)	2022-03-26 20:52:23 +01:00
litetex	49dfdae5d2	Removed useless throw declaration	2022-03-26 20:51:38 +01:00
litetex	69a58fd3d1	Don't use sysout	2022-03-26 20:51:37 +01:00
litetex	639be7adda	Minimized some code	2022-03-26 20:51:37 +01:00
litetex	2bd4299563	Fixed test: Peertube - Use new comments and videos Old comments/videos got deleted	2022-03-26 20:49:39 +01:00
litetex	606a386a98	Fix tests: What's peertube has less comments now	2022-03-26 20:49:39 +01:00
litetex	4d1a1c8fb8	Better test for ``MediaCCCRecentListExtractorTest`` * Use assertAll * Show which item is affected	2022-03-26 20:49:38 +01:00
litetex	ba43dbaa28	Fix Tests: The channel lost abos	2022-03-26 20:49:38 +01:00
litetex	ff436e5740	Fixed test: YT mix playlist	2022-03-26 20:49:38 +01:00
litetex	9c07e8a664	Fix useage of wrong object	2022-03-26 20:17:50 +01:00
litetex	804e57004f	Fixed new checkstyle problems from dev	2022-03-26 19:46:10 +01:00
litetex	33347ac18b	Removed unused methods ``contentFilters`` and ``sortfilter`` are get inside the ``ListLinkHandler`` and not the ``ListLinkHandlerFactory`` ``ListLinkHandlerFactory`` only passes these values through when ``fromQuery`` is called	2022-03-26 19:43:11 +01:00
litetex	ec5b54c38b	Removed unused class	2022-03-26 19:43:10 +01:00
litetex	8771af7ba5	Restored original naming	2022-03-26 19:43:09 +01:00
Stypox	bdadcfa1f7	Legitimately suppress remaining checkstyle warnings	2022-03-26 19:43:08 +01:00
Stypox	740a37a2de	[YouTube] Fix checkstyle issues	2022-03-26 19:42:40 +01:00
Stypox	9dc17cd1ca	[Soundcloud] Fix checkstyle issues	2022-03-26 19:40:20 +01:00
Stypox	9ab32cb2e7	[Peertube] Fix checkstyle issues	2022-03-26 19:40:19 +01:00
Stypox	9f7e06c817	[MediaCCC] Fix checkstyle issues	2022-03-26 19:40:18 +01:00
Stypox	3a94839359	[Bandcamp] Fix checkstyle issues	2022-03-26 19:40:17 +01:00
Stypox	08dff33002	Use Java 8 streams in NewPipe class	2022-03-26 19:40:15 +01:00
Stypox	c2446ecff0	Use Java 8 streams and deduplicate code in MediaFormat class	2022-03-26 19:40:15 +01:00
Stypox	d79e20340c	Fix checkstyle issues in root package extractor/ Note: not all issues were fixed because MediaFormat and ServiceList use a specific formatting that makes sense for them	2022-03-26 19:40:14 +01:00
Stypox	ca7c63f273	Fix remaining checkstyle issues in utils/ subpackage	2022-03-26 19:40:13 +01:00
Stypox	1d5f22e41f	Fix checkstyle issues & more in JsonUtils Also use Java 8 streams and extract duplicate code to getInstanceOf function	2022-03-26 19:40:13 +01:00
Stypox	87d2834986	Fix checkstyle issues & more in DonationLinkHelper Also add comment about the class being unused and replace the fixLink function with Utils.stringToUrl()	2022-03-26 19:40:12 +01:00
Stypox	bd7b362040	Fix checkstyle issues & more in DashMpdParser Also remove useless null check on ItagItem.getItag() as that function already throws an exception if there is no itag	2022-03-26 19:40:11 +01:00
Stypox	8aba2b47b0	Fix checkstyle issues in subpackages with abstract classes	2022-03-26 19:40:10 +01:00
Stypox	e4951a0623	Refactor code handling http headers in downloader.Request	2022-03-26 19:37:47 +01:00
Stypox	37690058d2	Add checkstyle to extractor gradle project With respect to NewPipe's checkstyle.xml, checkstyle is disabled for javadoc comments. There is no need for strict rules over comments here in the extractor, as sometimes javadocs are just needed to clarify a small thing and having empty/meaningless @param or @throws is useless.	2022-03-26 19:37:46 +01:00
litetex	9284569c84	Merge pull request #774 from TeamNewPipe/dependabot/gradle/org.mozilla-rhino-1.7.14 Bump rhino from 1.7.13 to 1.7.14	2022-03-26 17:25:55 +01:00
XiangRongLin	aa6b7272a4	Merge pull request #804 from Stypox/fix-yt-music-mix [YouTube] Fix music mixes in some countries	2022-03-20 08:35:56 +01:00
Stypox	09ddb6adbb	[YouTube] Add MockOnly to method testing mixes in related items	2022-03-19 10:54:38 +01:00
Stypox	8201b3b90e	[YouTube] Parse any playlist (including music mixes) in related items	2022-03-19 10:48:13 +01:00
Stypox	13f7900816	[YouTube] Add test for genre mix	2022-03-19 10:48:13 +01:00
Stypox	279f3a20fe	[YouTube] Fix mix tests with invalid video ids Replaces mix tests based on a strange mix type RDQM{videoId} (only reference I could find is https://github.com/ytdl-org/youtube-dl/issues/26228) and with an invalid video id of 13 characters (the first two characters were QM, but even after removing QM there still wasn't a video available at that id). Also updates mocks.	2022-03-19 10:48:13 +01:00
Stypox	d660c04838	[YouTube] Also test playlist type in playlist tests	2022-03-19 10:48:13 +01:00
Stypox	401082abe4	[YouTube] Extract playlist type in playlist extractor	2022-03-19 10:48:12 +01:00
Stypox	63ed06a710	[YouTube] Differentiate genre mixes from normal mixes Note: genre mixes already worked, now they are just considered as such in various video id extraction and in related items Note 2: now extracting a mix id from a normal youtube mix id will fail if the video id wouldn't be exactly 11 characters long	2022-03-19 10:46:31 +01:00
Stypox	f19660e7d9	[YouTube] Deduplicate code extracting video id from mix id	2022-03-19 10:46:30 +01:00
Stypox	8f9d5b858e	[YouTube] Remove useless comments about mixes	2022-03-19 10:44:06 +01:00
Stypox	34a4484c72	[YouTube] Add test for a video with a mix in related items	2022-03-19 10:44:06 +01:00
Stypox	50db871d89	[YouTube] Extract mixes from streams related items	2022-03-19 10:44:06 +01:00
Stypox	638da1756c	[Mix] Create MultiInfoItemsCollector It is a collector that can handle many extractor types, to be used when a list contains items of different types (e.g. search). It was renamed from InfoItemsSearchCollector so that it can now be used not just for search but for any extractor needing it. It supports, streams, channels, playlists and mixes.	2022-03-19 10:44:06 +01:00
Stypox	53673d64c6	[Mix] Add type to playlists & playlist items, to distinguish mixes	2022-03-19 10:44:06 +01:00
Stypox	d8f2031619	Merge pull request #816 from Stypox/mock-only-extension Add `@MockOnly` Junit 5 extension	2022-03-19 10:40:38 +01:00
litetex	cc2e4d7104	Merge pull request #815 from litetex/fix-soundcloud-id-once-and-for-all Removed hardcoded soundcloud HARDCODED_CLIENT_ID	2022-03-17 13:54:08 +01:00
TiA4f8R	c7757c0994	Apply requested changes	2022-03-16 20:14:08 +01:00
TiA4f8R	35e082248e	Fix YouTube and SoundCloud playlists tests	2022-03-16 19:40:30 +01:00
TiA4f8R	8b3f90eb7e	[YouTube] Fix extraction of series playlists and don't return the view count as the stream count for learning playlists ITEM_COUNT_UNKNOWN is returned when the JSON array which contains usally the number of videos is less than 3 items. Also apply the same type of optimizations done in other PlaylistExtractors in YoutubePlaylistExtractor.	2022-03-16 19:18:58 +01:00
TiA4f8R	58a247907e	Apply changes in all playlist extractors except YoutubePlaylistExtractor Also fix some issues in the extractors, remove uneeded overrides, use the Java 8 Stream API where possible and replace usages of Utils.UTF_8 with StandardCharsets.UTF_8 in these classes.	2022-03-16 19:18:57 +01:00
TiA4f8R	fc6b45ee36	Implement some methods in PlaylistExtractor This will prevent their override in each child class where the values corresponding to the methods could not be extracted.	2022-03-16 19:18:36 +01:00
Stypox	73d1fd472f	Add MockOnly junit 5 test extension	2022-03-16 19:03:08 +01:00
Stypox	ef71a5fa0f	static final instead of final static	2022-03-16 17:24:33 +01:00
Stypox	0c37c75981	Make getDownloader static & extract getDownloaderType	2022-03-16 17:22:42 +01:00
Stypox	40aa5104b1	Merge pull request #786 from XiangRongLin/throttling_resilience [Youtube] Make throttling decryption more resilient to api change	2022-03-16 11:03:16 +01:00
litetex	ba56be8ef1	Removed hardcoded soundcloud id It never works (long enough) so let's simply remove it...	2022-03-15 21:19:19 +01:00
XiangRongLin	e726437da3	Update extractor/src/main/java/org/schabi/newpipe/extractor/services/youtube/extractors/YoutubeStreamExtractor.java Co-authored-by: Stypox <stypox@pm.me>	2022-03-15 17:10:05 +01:00
litetex	e7aee0ca57	Merge pull request #807 from FireMasterK/no-commentsinfo-instance Remove the need for a CommentsInfo instance in CommentsInfo.getMoreItems and fix PeertubeCommentsExtractorTest.Default test	2022-03-15 15:06:56 +01:00
litetex	d806984aa8	Merge pull request #800 from TeamNewPipe/dependabot/gradle/com.google.code.gson-gson-2.9.0 Bump gson from 2.8.9 to 2.9.0	2022-03-14 19:47:54 +01:00
FireMasterK	60cc71e944	Remove the need for a CommentsInfo instance.	2022-03-03 11:48:41 +00:00

... 5 6 7 8 9 ...

2022 Commits