<?xml version="1.0" encoding="UTF-8"?><!-- generator="bbPress" -->

<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
>

<channel>
<title>JCrawler Forum Topic: Jcrawler Not Crawling Site - error 403 - one link found</title>
<link>http://www.pixelschieber.ch/forum/</link>
<description>Help forums for JCrawler</description>
<language>en</language>
<pubDate>Fri, 10 Sep 2010 20:10:31 +0000</pubDate>

<item>
<title>patrick on "Jcrawler Not Crawling Site - error 403 - one link found"</title>
<link>http://www.pixelschieber.ch/forum/topic/jcrawler-not-crawling-site-error-403-one-link-found#post-294</link>
<pubDate>Sun, 21 Jun 2009 23:35:36 +0000</pubDate>
<dc:creator>patrick</dc:creator>
<guid isPermaLink="false">294@http://www.pixelschieber.ch/forum/</guid>
<description>&#60;p&#62;Hey Josh,&#60;/p&#62;
&#60;p&#62;can you send me an email of your rewrite\&#38;#39;s?&#60;br /&#62;
or maybe you can try the latest release?:&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://www.pixelschieber.ch/com_jcrawler_latestbuild.zip&#34; rel=&#34;nofollow&#34;&#62;http://www.pixelschieber.ch/com_jcrawler_latestbuild.zip&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;Greets Patrick
&#60;/p&#62;</description>
</item>
<item>
<title>passmaster16 on "Jcrawler Not Crawling Site - error 403 - one link found"</title>
<link>http://www.pixelschieber.ch/forum/topic/jcrawler-not-crawling-site-error-403-one-link-found#post-286</link>
<pubDate>Sun, 14 Jun 2009 04:51:10 +0000</pubDate>
<dc:creator>passmaster16</dc:creator>
<guid isPermaLink="false">286@http://www.pixelschieber.ch/forum/</guid>
<description>&#60;p&#62;Hi Patrick,&#60;/p&#62;
&#60;p&#62;My actual site is located at &#60;a href=&#34;http://www.lebanonpc.org/Joomla_lpc&#34; rel=&#34;nofollow&#34;&#62;http://www.lebanonpc.org/Joomla_lpc&#60;/a&#62; but I use modrewrite to do 301 redirects to &#60;a href=&#34;http://www.lebanonpc.org&#34; rel=&#34;nofollow&#34;&#62;http://www.lebanonpc.org&#60;/a&#62; to eliminate confusion and make it easier for visitors to find us.  I don\&#38;#39;t have any additional SEF components installed.  I am using the built-in SEO options for Joomla.  HTTP Host (readonly) has &#60;a href=&#34;http://www.lebanonpc.org&#34; rel=&#34;nofollow&#34;&#62;http://www.lebanonpc.org&#60;/a&#62; listed so maybe the modrewrite is still causing a problem for the crawler when installed locally?  I did not modify the .htaccess in the Joomla directory but I do have a modified .htaccess in the root of my public directory to handle the modrewrite redirection.&#60;/p&#62;
&#60;p&#62;Thanks&#60;br /&#62;
Josh
&#60;/p&#62;</description>
</item>
<item>
<title>patrick on "Jcrawler Not Crawling Site - error 403 - one link found"</title>
<link>http://www.pixelschieber.ch/forum/topic/jcrawler-not-crawling-site-error-403-one-link-found#post-282</link>
<pubDate>Sat, 13 Jun 2009 18:26:58 +0000</pubDate>
<dc:creator>patrick</dc:creator>
<guid isPermaLink="false">282@http://www.pixelschieber.ch/forum/</guid>
<description>&#60;p&#62;hmm it seems to be an rewrite-error. is there a SEF component installed?&#60;/p&#62;
&#60;p&#62;i crawled your site and got &#60;/p&#62;
&#60;p&#62;There are 729 links in your sitemap.&#60;br /&#62;
total time: 163.5603 seconds&#60;/p&#62;
&#60;p&#62;what is written in the \&#38;quot;HTTP host (readonly)\&#38;quot; field? did you modify your .htaccess?&#60;/p&#62;
&#60;p&#62;greets Patrick
&#60;/p&#62;</description>
</item>
<item>
<title>passmaster16 on "Jcrawler Not Crawling Site - error 403 - one link found"</title>
<link>http://www.pixelschieber.ch/forum/topic/jcrawler-not-crawling-site-error-403-one-link-found#post-281</link>
<pubDate>Sat, 13 Jun 2009 06:06:53 +0000</pubDate>
<dc:creator>passmaster16</dc:creator>
<guid isPermaLink="false">281@http://www.pixelschieber.ch/forum/</guid>
<description>&#60;p&#62;Still having issues with Jcrawler 1.7.  In fact Jcrawler has not worked for me since 1.4.  I think it\\&#38;#39;s due to me using modrewrite.  I posted the information from the last message I posted on here back in Oct 2008 when I upgraded from 1.4 to 1.5.  Jcrawler has not worked for me since.  I even tried the latest 1.7 version which you modified a few days ago to fix relative paths...still same thing :(&#60;/p&#62;
&#60;p&#62;Thanks&#60;/p&#62;
&#60;p&#62;Here is the current error I get with 1.7:&#60;/p&#62;
&#60;p&#62;        * httpcode: 403 on url &#60;a href=&#34;http://www.lebanonpc.org/&#34; rel=&#34;nofollow&#34;&#62;http://www.lebanonpc.org/&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;Message&#60;/p&#62;
&#60;p&#62;        * There are 1 links in your sitemap.&#60;br /&#62;
        * Success, wrote /home/lebanonp/public_html/Joomla_lpc/sitemap.xml&#60;br /&#62;
        * Success, wrote /home/lebanonp/public_html/Joomla_lpc//administrator/components/com_jcrawler/config.xml&#60;br /&#62;
        * total time: 0.0213 seconds&#60;/p&#62;
&#60;p&#62;****************************************************************&#60;br /&#62;
Here is my post from a while ago that you responded to.&#60;/p&#62;
&#60;p&#62;#   Josh B. posted the following on 19. October 2008 at 21:00.&#60;/p&#62;
&#60;p&#62;Hi, thanks for this tool.  I am using modrewrite to point requests to a subdirectory.  This appears to confuse Jcrawler 1.5.  When I go into the configuration, HTTP host is set to the actual subdirectory where the Joomla install resides.  However, crawling fails because modrewrite is hiding that directory.  The live_site variable is set to the URL without the subdirectory yet 1.5 must not check this.  I did not have this problem with 1.4 beta.  Did something change in 1.5 in how you determine the site’s URL?&#60;/p&#62;
&#60;p&#62;   1. patrick posted the following on 21. October 2008 at 15:37.&#60;/p&#62;
&#60;p&#62;      Hi Josh,&#60;/p&#62;
&#60;p&#62;      yes, i changed the method, it takes the website from the php variable $_SERVER[\\&#38;#39;HTTP_HOST\\&#38;#39;],&#60;/p&#62;
&#60;p&#62;      but i found a clear solution now, i’ll change it back in the next release. If you want to change it by yourself please write me an email.&#60;/p&#62;
&#60;p&#62;      Greets Patrick&#60;/p&#62;
&#60;p&#62;      PS: Sorry for the delayed answer.&#60;br /&#62;
         1. Josh. B. posted the following on 21. October 2008 at 17:12.&#60;/p&#62;
&#60;p&#62;            Hi Patrick.  Thanks for your reply.  I’ve just reverted back to 1.4 beta for now.  I’ll wait for your next release for the URL fix.  Thanks again for this helpful component!Josh
&#60;/p&#62;</description>
</item>

</channel>
</rss>
