JCrawler Forum

  • About
  • JCrawler - 1.7 Beta Joomla 1.5
  • JCrawler forum

  • Home
  • JCrawler
  • JCrawler Forum

JCrawler Forum

Help forums for JCrawler

Register or log in (lost password?):

JCrawler Forum

» Problems

Jcrawler Not Crawling Site - error 403 - one link found

(4 posts)
  • Started 1 year ago by passmaster16
  • Latest reply from patrick
  • Related Topics:
    1. JCrawler Tips from trial and error experience
    2. Error when uninstalling jcrawler for upgrade
    3. Errors after I start the JCrawler
    4. Jcrawler Error 404 and 500 - How do you trace?
    5. Jcrawler giving 403 errors.

No tags yet.

  1. passmaster16
    Member

    Still having issues with Jcrawler 1.7. In fact Jcrawler has not worked for me since 1.4. I think it\\'s due to me using modrewrite. I posted the information from the last message I posted on here back in Oct 2008 when I upgraded from 1.4 to 1.5. Jcrawler has not worked for me since. I even tried the latest 1.7 version which you modified a few days ago to fix relative paths...still same thing :(

    Thanks

    Here is the current error I get with 1.7:

    * httpcode: 403 on url http://www.lebanonpc.org/

    Message

    * There are 1 links in your sitemap.
    * Success, wrote /home/lebanonp/public_html/Joomla_lpc/sitemap.xml
    * Success, wrote /home/lebanonp/public_html/Joomla_lpc//administrator/components/com_jcrawler/config.xml
    * total time: 0.0213 seconds

    ****************************************************************
    Here is my post from a while ago that you responded to.

    # Josh B. posted the following on 19. October 2008 at 21:00.

    Hi, thanks for this tool. I am using modrewrite to point requests to a subdirectory. This appears to confuse Jcrawler 1.5. When I go into the configuration, HTTP host is set to the actual subdirectory where the Joomla install resides. However, crawling fails because modrewrite is hiding that directory. The live_site variable is set to the URL without the subdirectory yet 1.5 must not check this. I did not have this problem with 1.4 beta. Did something change in 1.5 in how you determine the site’s URL?

    1. patrick posted the following on 21. October 2008 at 15:37.

    Hi Josh,

    yes, i changed the method, it takes the website from the php variable $_SERVER[\\'HTTP_HOST\\'],

    but i found a clear solution now, i’ll change it back in the next release. If you want to change it by yourself please write me an email.

    Greets Patrick

    PS: Sorry for the delayed answer.
    1. Josh. B. posted the following on 21. October 2008 at 17:12.

    Hi Patrick. Thanks for your reply. I’ve just reverted back to 1.4 beta for now. I’ll wait for your next release for the URL fix. Thanks again for this helpful component!Josh

    Posted 1 year ago #
  2. patrick
    Key Master

    hmm it seems to be an rewrite-error. is there a SEF component installed?

    i crawled your site and got

    There are 729 links in your sitemap.
    total time: 163.5603 seconds

    what is written in the \"HTTP host (readonly)\" field? did you modify your .htaccess?

    greets Patrick

    Posted 1 year ago #
  3. passmaster16
    Member

    Hi Patrick,

    My actual site is located at http://www.lebanonpc.org/Joomla_lpc but I use modrewrite to do 301 redirects to http://www.lebanonpc.org to eliminate confusion and make it easier for visitors to find us. I don\'t have any additional SEF components installed. I am using the built-in SEO options for Joomla. HTTP Host (readonly) has http://www.lebanonpc.org listed so maybe the modrewrite is still causing a problem for the crawler when installed locally? I did not modify the .htaccess in the Joomla directory but I do have a modified .htaccess in the root of my public directory to handle the modrewrite redirection.

    Thanks
    Josh

    Posted 1 year ago #
  4. patrick
    Key Master

    Hey Josh,

    can you send me an email of your rewrite\'s?
    or maybe you can try the latest release?:

    http://www.pixelschieber.ch/com_jcrawler_latestbuild.zip

    Greets Patrick

    Posted 1 year ago #

RSS feed for this topic

Reply

You must log in to post.

Pages

  • About
  • JCrawler - 1.7 Beta Joomla 1.5
  • JCrawler forum

Socialising

  • Facebook
  • Last fm

Webdesign

  • Cool Webdesigner

Recent Comments

  • patrick on Joomla 1.5 - SEO - Tipps und Tricks
  • Pablo on Circuit de Chenevières Jacques Cornu
  • ledzep on Joomla 1.5 - SEO - Tipps und Tricks
  • David on Joomla 1.5 - SEO - Tipps und Tricks
  • Sanakirja on Validation - Smile

Categories

Tags

Amerika anneau du rhin Audi Bridgestone cornu CYMK der standard Detail Druck Druckerei Erwartung Exportieren Farbkomponenten Film Frankreich Hammer Humor InDesign Ironman jacques Kinostreich Marvell maschine Modus Mopped PDF pdf standards Photoshop pixelschieber pixler Plan Racingtag reifen Rennstrecke Runde Schade Sponsorentasche Supersportler support masters Tiefsinn tipps Trailer Visitenkarte Vorlage Yamaha R1

Recent Posts

  • Circuit de Chenevières Jacques Cornu
  • TicTac Spot
  • Validation - Smile
  • James Bond 007: Quantum of Solace - Ein Quantum Trost - Filmkritik
  • Joomla 1.5 - SEO - Tipps und Tricks

Last referers

  • - http://oo(...)emap.html
  • - http://ub(...)online-35
  • - http://no(...)emap.html

Top Browsers

  • - IE 6
  • - Firefox 3
  • - IE 7
  • - IE 5
  • - Opera 9

Top OS

  • - WinXP
  • - WinVista
  • - Win2008
  • - Win2000
  • - MacOSX

Visitors Online

  • 01 visitor(s) online
  • powered by WassUp

JCrawler Forum is proudly powered by bbPress.