We embedded a mp3 audio file as an inro to our index page, how does this effect the crawl behaviour?

2 views
Skip to first unread message

Magellan

unread,
Oct 15, 2009, 6:46:34 PM10/15/09
to SOFTplus GSiteCrawler
My site's URL is: http://www.culturalgenetics.com/

We embedded a mp3 audio file as an inro to our index page to be opened
with the index page. How does this effect

this page and the whole site to be crawled succesfully?

The code we needed to integrate to the page html is:

<EMBED src="DenizSarCulturalGeneticsCorporateGeneticsGroupIntro.mp3"
autostart=true loop=false width="0"

height="0" true width="0" height="0" border="0"><noembed>Microsoft
Media Player Required</noembed>

We tested with the Webmaster Tools, but we can not figure out whether
the crawl robots can move forward beyond

this code or get stuck right there.

For example, the A1 site crawler we have is prevented from crawling
beyond the index page where it starts, thus how

about the regular Google Crawl Robots, is the same effect conceivable
for them too ? And how can we test this

beyond any doubt ?

Dear Google Web Master Help Community, we sincerely thank You for Your
contributions to this matter at Your earliest convenience...

Best Regards,

Magellan 6:38 PM

webado

unread,
Oct 16, 2009, 1:13:20 AM10/16/09
to SOFTplus GSiteCrawler
That <embed,,> tag, while invalid as far as the w3 is concerned, does
not affect crawling in any way. It will be totally ignored. Robots
don't play music in any case.
http://validator.w3.org/check?verbose=1&uri=http://www.culturalgenetics.com/



There must be other problem on your web page if you say robots don't
go beyond it.

You rrobots.txt file disallows lots of things. You shodul check that
it doesn't disallow genuine page you want indexed, or any pages needed
to get to those pages you want indexed. Too much stuff for me to go
through.

Using Xenu Link SLeuth (download it from http://home.snafu.de/tilman/xenulink.html
) , we can see your site also has a lot of broken links.

Just a few examples:

http://www.culturalgenetics.com/%A9%20Copyright,%20F.%20Deniz%20Sar,%20USA,%201980%20-%202096.%20All%20rights%20are%20reserved%20in%20USA%20and%20worldwide.
error code: 404 (not found), linked from page(s):
http://www.culturalgenetics.com/
http://www.culturalgenetics.com/index.html

http://www.culturalgenetics.com/Aray%FDn%20ve%20Deniz%20%DEar'%FDn%20Tarihi%20Kitaplar%FDn%FD%20%DEimdi%20Orjinal%20%DDmzal%FD%20%D6zel%20Bas%FDm%20Olarak%20Al%FDn
error code: 404 (not found), linked from page(s):
http://www.culturalgenetics.com/DenizSarsHistoricBooks.htm?__mk_ja_JP=�J�^�J�i&url=search-alias=aps&field-keywords=Principia&Go.x=9&Go.y=10
http://www.culturalgenetics.com/DenizSarsHistoricBooks.htm
http://www.culturalgenetics.com/DenizSarsBooksOfHistoricalMagnitude.htm
http://www.culturalgenetics.com/DenizSarTarihiKitaplar.htm
http://www.culturalgenetics.com/DenizSarTarihiKitaplarII.htm

http://www.culturalgenetics.com/Aray%FDn%20ve%20Deniz%20%DEar'%FDn%20Tarihi%20Kitaplar%FDn%FD%20%DEimdi%20Orjinal%20%DDmzal%FD%20%D6zel%20Bas%FDm%20Olarak%20Al%FDn:%20+1%20406%20466%204527%20%209%20am%20and%205%20pm%20US%20Eastern%20Time
error code: 404 (not found), linked from page(s):
http://www.culturalgenetics.com/DenizSarsHistoricBooks.htm?__mk_ja_JP=�J�^�J�i&url=search-alias=aps&field-keywords=Principia&Go.x=9&Go.y=10
http://www.culturalgenetics.com/DenizSarsHistoricBooks.htm
http://www.culturalgenetics.com/DenizSarsBooksOfHistoricalMagnitude.htm
http://www.culturalgenetics.com/DenizSarTarihiKitaplar.htm
http://www.culturalgenetics.com/DenizSarTarihiKitaplarII.htm




Your forum generates urls with session id's - that's a problem,
ebcause there's an infinite number of them, yet they are all the same
pages. Look into ways to prevent sessions id's when robots visit.
Reply all
Reply to author
Forward
0 new messages