Reverse an earlier over-zealous check for a non-null value of the retrie... #119
1 changed files with 5 additions and 5 deletions
|
@ -18,7 +18,7 @@ connectivity.
|
||||||
</p>
|
</p>
|
||||||
<p>
|
<p>
|
||||||
However, setting up retriever can be quite tricky since it depends on
|
However, setting up retriever can be quite tricky since it depends on
|
||||||
the internal design of the website. This was designed to make life
|
the internal design of the website. That was designed to make life
|
||||||
easy for the website's developers, not for you. You'll need to have
|
easy for the website's developers, not for you. You'll need to have
|
||||||
some familiarity with HTML, and be willing to adapt when the website
|
some familiarity with HTML, and be willing to adapt when the website
|
||||||
suddenly changes everything without notice.
|
suddenly changes everything without notice.
|
||||||
|
@ -43,7 +43,7 @@ A simple case is when the article is wrapped in a "div" element:
|
||||||
</p>
|
</p>
|
||||||
<pre>
|
<pre>
|
||||||
...
|
...
|
||||||
<div class="main-content">
|
<div class="ArticleWrapper">
|
||||||
<h2>Man Bites Dog</h2>
|
<h2>Man Bites Dog</h2>
|
||||||
<img src="mbd.jpg">
|
<img src="mbd.jpg">
|
||||||
<p>
|
<p>
|
||||||
|
@ -58,7 +58,7 @@ A simple case is when the article is wrapped in a "div" element:
|
||||||
</pre>
|
</pre>
|
||||||
<p>
|
<p>
|
||||||
You then specify the tag "div", attribute "class", and value
|
You then specify the tag "div", attribute "class", and value
|
||||||
"main-content". Everything else in the page, such as navigation
|
"ArticleWrapper". Everything else in the page, such as navigation
|
||||||
panels and menus and footers and so on, will be discarded. If there
|
panels and menus and footers and so on, will be discarded. If there
|
||||||
is more than one section of the page you want to include, specify each
|
is more than one section of the page you want to include, specify each
|
||||||
one on a separate row. If the matching section contains some sections
|
one on a separate row. If the matching section contains some sections
|
||||||
|
@ -76,7 +76,7 @@ articles should be available.
|
||||||
<p>
|
<p>
|
||||||
You can leave the attribute and value blank to include all the
|
You can leave the attribute and value blank to include all the
|
||||||
corresponding elements with the specified tag name. You can also use
|
corresponding elements with the specified tag name. You can also use
|
||||||
a tag name of "*", which will match any element type with the
|
a tag name of just an asterisk ("*"), which will match any element type with the
|
||||||
specified attribute regardless of the tag.
|
specified attribute regardless of the tag.
|
||||||
</p>
|
</p>
|
||||||
<p>
|
<p>
|
||||||
|
@ -120,7 +120,7 @@ To change the URL used to retrieve the page, use the "URL Pattern" and
|
||||||
"URL Replace" fields. The pattern is a regular expression matching
|
"URL Replace" fields. The pattern is a regular expression matching
|
||||||
part of the URL to replace. In this case, you might use a pattern of
|
part of the URL to replace. In this case, you might use a pattern of
|
||||||
"/article" and a replace string of "/print/article". A common pattern
|
"/article" and a replace string of "/print/article". A common pattern
|
||||||
is simply "$", used to add the replace string to the end of the URL.
|
is simply a dollar sign ("$"), used to add the replace string to the end of the URL.
|
||||||
</p>
|
</p>
|
||||||
<h3>Background Processing</h3>
|
<h3>Background Processing</h3>
|
||||||
<p>
|
<p>
|
||||||
|
|
Loading…
Reference in a new issue