This post means to start a discussion on SEF Sitemap.
Since I dicoverd this tool I used it on all my Joomla sites because of the strange results using non-Joomla oriented tools.
As a novice user I also used the default \"Forbidden file types\".
I recent discussion on SE handling of duplicate content and an examination of the generated JCrawler sitemap.xml I noticed that for every html page that contains a the typical PDF icon to produce a document, I find both a html and a pdf URL in the sitemap.xml and both have - of course - the same content.
Chances are Google will recognse this as duplicate content and will subsequently lower the pagerank.
So I strongly believe that these kind of pagelinks should not be followed or the .pdf should be in the default \"Forbidden file types\" in the next revision.
If anybody has more tips for dos and don\'ts I would really like the read and or discuss them.