Reverse an earlier over-zealous check for a non-null value of the retrie... #119

Merged
mexon merged 3 commits from retriever into master 2013-05-06 07:08:52 +02:00
Showing only changes of commit c7878a0f78 - Show all commits

View file

@ -18,7 +18,7 @@ connectivity.
</p> </p>
<p> <p>
However, setting up retriever can be quite tricky since it depends on However, setting up retriever can be quite tricky since it depends on
the internal design of the website. This was designed to make life the internal design of the website. That was designed to make life
easy for the website's developers, not for you. You'll need to have easy for the website's developers, not for you. You'll need to have
some familiarity with HTML, and be willing to adapt when the website some familiarity with HTML, and be willing to adapt when the website
suddenly changes everything without notice. suddenly changes everything without notice.
@ -43,7 +43,7 @@ A simple case is when the article is wrapped in a "div" element:
</p> </p>
<pre> <pre>
... ...
&lt;div class="main-content"&gt; &lt;div class="ArticleWrapper"&gt;
&lt;h2&gt;Man Bites Dog&lt;/h2&gt; &lt;h2&gt;Man Bites Dog&lt;/h2&gt;
&lt;img src="mbd.jpg"&gt; &lt;img src="mbd.jpg"&gt;
&lt;p&gt; &lt;p&gt;
@ -58,7 +58,7 @@ A simple case is when the article is wrapped in a "div" element:
</pre> </pre>
<p> <p>
You then specify the tag "div", attribute "class", and value You then specify the tag "div", attribute "class", and value
"main-content". Everything else in the page, such as navigation "ArticleWrapper". Everything else in the page, such as navigation
panels and menus and footers and so on, will be discarded. If there panels and menus and footers and so on, will be discarded. If there
is more than one section of the page you want to include, specify each is more than one section of the page you want to include, specify each
one on a separate row. If the matching section contains some sections one on a separate row. If the matching section contains some sections
@ -76,7 +76,7 @@ articles should be available.
<p> <p>
You can leave the attribute and value blank to include all the You can leave the attribute and value blank to include all the
corresponding elements with the specified tag name. You can also use corresponding elements with the specified tag name. You can also use
a tag name of "*", which will match any element type with the a tag name of just an asterisk ("*"), which will match any element type with the
specified attribute regardless of the tag. specified attribute regardless of the tag.
</p> </p>
<p> <p>
@ -120,7 +120,7 @@ To change the URL used to retrieve the page, use the "URL Pattern" and
"URL Replace" fields. The pattern is a regular expression matching "URL Replace" fields. The pattern is a regular expression matching
part of the URL to replace. In this case, you might use a pattern of part of the URL to replace. In this case, you might use a pattern of
"/article" and a replace string of "/print/article". A common pattern "/article" and a replace string of "/print/article". A common pattern
is simply "$", used to add the replace string to the end of the URL. is simply a dollar sign ("$"), used to add the replace string to the end of the URL.
</p> </p>
<h3>Background Processing</h3> <h3>Background Processing</h3>
<p> <p>