Deployed da41b6d with MkDocs version: 1.0.4

2019-07-02 13:24:37 +01:00 · 2019-07-02 13:24:37 +01:00 · bc52d25446
parent dbcbf2bad6
commit bc52d25446
59 changed files with 1794 additions and 1748 deletions
--- a/00_Prepare_everything/index.html
+++ b/00_Prepare_everything/index.html
@ -153,10 +153,9 @@
            <div class="section">
              
                <h1 id="before-you-start">Before You Start</h1>
-<p>These documents will guide you through the process of creating your own Extractor
-service of which will enable NewPipe to access additional streaming services, such as the currently supported YouTube and SoundCloud.
-The whole documentation consists of this page, which explains the general concept of the NewPipeExtractor, as well as our
-<a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/">Jdoc</a> setup.</p>
+<p>These documents will guide you through the process of understanding or creating your own Extractor
+service of which will enable NewPipe to access additional streaming services, such as the currently supported YouTube, SoundCloud and MediaCCC.
+The whole documentation consists of this page and <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/">Jdoc</a> setup, which explains the general concept of the NewPipeExtractor.</p>
 <p><strong>IMPORTANT!!!</strong> This is likely to be the worst documentation you have ever read, so do not hesitate to
 <a href="https://github.com/teamnewpipe/documentation/issues">report</a> if
 you find any spelling errors, incomplete parts or you simply don't understand something. We are an open community
--- a/01_Concept_of_the_extractor/index.html
+++ b/01_Concept_of_the_extractor/index.html
@ -75,7 +75,7 @@
        
            <li><a class="toctree-l3" href="#collectorextractor-pattern-for-lists">Collector/Extractor Pattern for Lists</a></li>
        
-            <li><a class="toctree-l3" href="#infoitems-encapsulated-in-pages">InfoItems Encapsulated in Pages</a></li>
+            <li><a class="toctree-l3" href="#listextractor">ListExtractor</a></li>
        
        </ul>
    
@ -196,15 +196,16 @@ try {
 <h2 id="collectorextractor-pattern-for-lists">Collector/Extractor Pattern for Lists</h2>
 <p>Information can be represented as a list. In NewPipe, a list is represented by a
 <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/InfoItemsCollector.html">InfoItemsCollector</a>.
-A InfoItemCollector will collect and assemble a list of <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/InfoItem.html">InfoItem</a>.
-For each item that should be extracted, a new Extractor must be created, and given to the InfoItemCollector via <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/InfoItemsCollector.html#commit-E-">commit()</a>.</p>
+A InfoItemsCollector will collect and assemble a list of <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/InfoItem.html">InfoItem</a>.
+For each item that should be extracted, a new Extractor must be created, and given to the InfoItemsCollector via <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/InfoItemsCollector.html#commit-E-">commit()</a>.</p>
 <p><img alt="InfoItemsCollector_objectdiagram.svg" src="../img/InfoItemsCollector_objectdiagram.svg" /></p>
-<p>If you are implementing a list for your service you need to extend InfoItem containing the extracted information
-and implement an <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/Extractor.html">InfoItemExtractor</a>,
-that will return the data of one InfoItem.</p>
+<p>If you are implementing a list in your service you need to implement an <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/Extractor.html">InfoItemExtractor</a>,
+that will be able to retreve data for one and only one InfoItem. This extractor will then be <em>comitted</em> to the <strong>InfoItemsCollector</strong> that can collect the type of InfoItems you want to generate.</p>
 <p>A common implementation would look like this:</p>
-<pre><code>private MyInfoItemCollector collectInfoItemsFromElement(Element e) {
-    MyInfoItemCollector collector = new MyInfoItemCollector(getServiceId());
+<pre><code>private SomeInfoItemCollector collectInfoItemsFromElement(Element e) {
+    // See *Some* as something like Stream or Channel
+    // e.g. StreamInfoItemsCollector, and ChannelInfoItemsCollector are provided by NP
+    SomeInfoItemCollector collector = new SomeInfoItemCollector(getServiceId());

    for(final Element li : element.children()) {
        collector.commit(new InfoItemExtractor() {
@ -225,15 +226,21 @@ that will return the data of one InfoItem.</p>

 </code></pre>

-<h2 id="infoitems-encapsulated-in-pages">InfoItems Encapsulated in Pages</h2>
+<h2 id="listextractor">ListExtractor</h2>
+<p>There is more to know about lists:</p>
+<ol>
+<li>
 <p>When a streaming site shows a list of items, it usually offers some additional information about that list like its title, a thumbnail,
 and its creator. Such info can be called <strong>list header</strong>.</p>
-<p>When a website shows a long list of items it usually does not load the whole list, but only a part of it. In order to get more items you may have to click on a next page button, or scroll down. </p>
-<p>This is why a list in NewPipe lists are chopped down into smaller lists called <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/ListExtractor.InfoItemsPage.html">InfoItemsPage</a>s. Each page has its own URL, and needs to be extracted separately.</p>
-<p>Additional metadata about the list and extracting multiple pages can be handled by a
-<a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/ListExtractor.html">ListExtractor</a>,
-and its <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/ListExtractor.InfoItemsPage.html">ListExtractor.InfoItemsPage</a>.</p>
-<p>For extracting list header information it behaves like a regular extractor. For handling <code>InfoItemsPages</code> it adds methods
+</li>
+<li>
+<p>When a website shows a long list of items it usually does not load the whole list, but only a part of it. In order to get more items you may have to click on a next page button, or scroll down.</p>
+</li>
+</ol>
+<p>Both of these Problems are fixed by the <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/ListExtractor.html">ListExtractor</a> which takes care about extracting additional metadata about the liast,
+and by chopping down lists into several pages, so called <a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/ListExtractor.InfoItemsPage.html">InfoItemsPage</a>s.
+Each page has its own URL, and needs to be extracted separately.</p>
+<p>For extracting list header information a <code>ListExtractor</code> behaves like a regular extractor. For handling <code>InfoItemsPages</code> it adds methods
 such as:</p>
 <ul>
 <li><a href="https://teamnewpipe.github.io/NewPipeExtractor/javadoc/org/schabi/newpipe/extractor/ListExtractor.html#getInitialPage--">getInitialPage()</a>
@ -245,6 +252,46 @@ such as:</p>
 </ul>
 <p>The reason why the first page is handled special is because many Websites such as YouTube will load the first page of
 items like a regular web page, but all the others as an AJAX request.</p>
+<p>An InfoItemsPage itself has two constructors which take these parameters:
+- The <strong>InfoitemsCollector</strong> of the list that the page should represent
+- A <strong>nextPageUrl</strong> which represents the url of the following page (may be null if not page follows).
+- Optionally <strong>errors</strong> which is a list of Exceptions that may have happened during extracton.</p>
+<p>Here is a simplified reference implementation of a list extractor that only extracts pages, but not metadata:</p>
+<pre><code>class MyListExtractor extends ListExtractor {
+    ...
+    private Document document;
+
+    ...
+
+    public InfoItemsPage&lt;SomeInfoItem&gt; getPage(pageUrl)
+        throws ExtractionException {
+        SomeInfoItemCollector collector = new SomeInfoItemCollector(getServiceId());
+        document = myFunctionToGetThePageHTMLWhatever(pageUrl);
+
+        //remember this part from the simple list extraction
+        for(final Element li : document.children()) {
+            collector.commit(new InfoItemExtractor() {
+                @Override
+                public String getName() throws ParsingException {
+                    ...
+                }
+
+                @Override
+                public String getUrl() throws ParsingException {
+                    ...
+                }
+                ...
+        }
+        return new InfoItemsPage&lt;SomeInfoItem&gt;(collector, myFunctionToGetTheNextPageUrl(document));
+    }
+
+    public InfoItemsPage&lt;SomeInfoItem&gt; getInitialPage() {
+        //document here got initialzied by the fetch() function.
+        return getPage(getTheCurrentPageUrl(document));
+    }
+    ... 
+}
+</code></pre>
              
            </div>
          </div>
--- a/02_Concept_of_LinkHandler/index.html
+++ b/02_Concept_of_LinkHandler/index.html
--- a/03_Implement_a_service/index.html
+++ b/03_Implement_a_service/index.html
--- a/04_Run_changes_in_App/index.html
+++ b/04_Run_changes_in_App/index.html
--- a/05_releasing/index.html
+++ b/05_releasing/index.html
--- a/06_documentation/index.html
+++ b/06_documentation/index.html
--- a/07_maintainers_view/index.html
+++ b/07_maintainers_view/index.html
--- a/404.html
+++ b/404.html
--- a/css/github.min.css
+++ b/css/github.min.css
--- a/css/highlight.css
+++ b/css/highlight.css
--- a/css/local_fonts.css
+++ b/css/local_fonts.css
--- a/css/theme.css
+++ b/css/theme.css
--- a/css/theme_child.css
+++ b/css/theme_child.css
--- a/css/theme_extra.css
+++ b/css/theme_extra.css
--- a/fonts/Inconsolata-Bold.ttf
+++ b/fonts/Inconsolata-Bold.ttf
--- a/fonts/Inconsolata-Regular.ttf
+++ b/fonts/Inconsolata-Regular.ttf
--- a/fonts/Lato-Bold.ttf
+++ b/fonts/Lato-Bold.ttf
--- a/fonts/Lato-BoldItalic.ttf
+++ b/fonts/Lato-BoldItalic.ttf
--- a/fonts/Lato-Italic.ttf
+++ b/fonts/Lato-Italic.ttf
--- a/fonts/Lato-Regular.ttf
+++ b/fonts/Lato-Regular.ttf
--- a/fonts/RobotoSlab-Bold.ttf
+++ b/fonts/RobotoSlab-Bold.ttf
--- a/fonts/RobotoSlab-Regular.ttf
+++ b/fonts/RobotoSlab-Regular.ttf
--- a/fonts/fontawesome-webfont.eot
+++ b/fonts/fontawesome-webfont.eot
--- a/fonts/fontawesome-webfont.svg
+++ b/fonts/fontawesome-webfont.svg
--- a/fonts/fontawesome-webfont.ttf
+++ b/fonts/fontawesome-webfont.ttf
--- a/fonts/fontawesome-webfont.woff
+++ b/fonts/fontawesome-webfont.woff
--- a/img/InfoItemsCollector_objectdiagram.svg
+++ b/img/InfoItemsCollector_objectdiagram.svg
--- a/img/check_path.png
+++ b/img/check_path.png
--- a/img/could_not_decrypt.png
+++ b/img/could_not_decrypt.png
--- a/img/draft_name.png
+++ b/img/draft_name.png
--- a/img/favicon.ico
+++ b/img/favicon.ico
--- a/img/feature_branch.svg
+++ b/img/feature_branch.svg
--- a/img/hotfix_branch.svg
+++ b/img/hotfix_branch.svg
--- a/img/jitpack_fail.png
+++ b/img/jitpack_fail.png
--- a/img/kde_in_a_nutshell.jpg
+++ b/img/kde_in_a_nutshell.jpg
--- a/img/merge_into_dev.svg
+++ b/img/merge_into_dev.svg
--- a/img/onedoes.jpg
+++ b/img/onedoes.jpg
--- a/img/prepare_tests_passed.png
+++ b/img/prepare_tests_passed.png
--- a/img/rebase_back_hotfix.svg
+++ b/img/rebase_back_hotfix.svg
--- a/img/rebase_back_release.svg
+++ b/img/rebase_back_release.svg
--- a/img/release_branch.svg
+++ b/img/release_branch.svg
--- a/img/select_gradle.png
+++ b/img/select_gradle.png
--- a/img/select_gradle_wrapper.png
+++ b/img/select_gradle_wrapper.png
--- a/img/sync_ok.png
+++ b/img/sync_ok.png
--- a/img/termux_files.png
+++ b/img/termux_files.png
--- a/index.html
+++ b/index.html
@ -198,5 +198,5 @@ It focuses on making it possible for the creator of a scraper for a streaming se

 <!--
 MkDocs version : 1.0.4
-Build Date UTC : 2019-04-07 17:32:18
+Build Date UTC : 2019-07-02 12:24:36
 -->
--- a/js/highlight.min.js
+++ b/js/highlight.min.js
--- a/js/jquery-2.1.1.min.js
+++ b/js/jquery-2.1.1.min.js
--- a/js/modernizr-2.8.3.min.js
+++ b/js/modernizr-2.8.3.min.js
--- a/js/theme.js
+++ b/js/theme.js
--- a/media/how_to_jitpack.mp4
+++ b/media/how_to_jitpack.mp4
--- a/search.html
+++ b/search.html
--- a/search/lunr.js
+++ b/search/lunr.js
--- a/search/main.js
+++ b/search/main.js
--- a/search/search_index.json
+++ b/search/search_index.json
--- a/search/worker.js
+++ b/search/worker.js
--- a/sitemap.xml
+++ b/sitemap.xml
@ -2,47 +2,47 @@
 <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
    <url>
     <loc>None</loc>
-     <lastmod>2019-04-07</lastmod>
+     <lastmod>2019-07-02</lastmod>
     <changefreq>daily</changefreq>
    </url>
    <url>
     <loc>None</loc>
-     <lastmod>2019-04-07</lastmod>
+     <lastmod>2019-07-02</lastmod>
     <changefreq>daily</changefreq>
    </url>
    <url>
     <loc>None</loc>
-     <lastmod>2019-04-07</lastmod>
+     <lastmod>2019-07-02</lastmod>
     <changefreq>daily</changefreq>
    </url>
    <url>
     <loc>None</loc>
-     <lastmod>2019-04-07</lastmod>
+     <lastmod>2019-07-02</lastmod>
     <changefreq>daily</changefreq>
    </url>
    <url>
     <loc>None</loc>
-     <lastmod>2019-04-07</lastmod>
+     <lastmod>2019-07-02</lastmod>
     <changefreq>daily</changefreq>
    </url>
    <url>
     <loc>None</loc>
-     <lastmod>2019-04-07</lastmod>
+     <lastmod>2019-07-02</lastmod>
     <changefreq>daily</changefreq>
    </url>
    <url>
     <loc>None</loc>
-     <lastmod>2019-04-07</lastmod>
+     <lastmod>2019-07-02</lastmod>
     <changefreq>daily</changefreq>
    </url>
    <url>
     <loc>None</loc>
-     <lastmod>2019-04-07</lastmod>
+     <lastmod>2019-07-02</lastmod>
     <changefreq>daily</changefreq>
    </url>
    <url>
     <loc>None</loc>
-     <lastmod>2019-04-07</lastmod>
+     <lastmod>2019-07-02</lastmod>
     <changefreq>daily</changefreq>
    </url>
 </urlset>
--- a/sitemap.xml.gz
+++ b/sitemap.xml.gz