I know this is a recuring question but I see it was not adressed in
2007 :-) How can I arrange for the tiddlywiki content to be visible to
search engines ? Currently all they get is something like : //\n\n",
MarkupPostHead: "", MarkupPreBody: "", MarkupPostBody: "",
TabTimeline: '<>', TabAll: '<>', TabTags: '<>', TabMoreMissing: '<>',
TabMoreOrphans: '<>', TabMoreShadowed: '<>', AdvancedOptions:
'<>', ...
Thanks in advance for your help
this should be of some help: "SEO TiddlyWiki Plugin"
http://www.superphysique.net/#%5B%5BSEO%20TiddlyWiki%20Plugin%5D%5D
Plus - there are some related threads:
"Is it possible to list all tiddlers as a list of permalinks? -
TiddlyWiki | Google Groups"
http://groups.google.com/group/TiddlyWiki/browse_frm/thread/ed9772d8287f8a1c/
"google indexing - any luck? - TiddlyWiki | Google Groups"
http://groups.google.com/group/TiddlyWiki/browse_frm/thread/f972ed85646f8db3/
"TiddlyWiki and SEO - TiddlyWiki | Google Groups"
http://groups.google.com/group/TiddlyWiki/browse_frm/thread/a38f0db45c1fd1e8/
"SEO TiddlyWiki Plugin - TiddlyWiki | Google Groups"
http://groups.google.com/group/TiddlyWiki/browse_frm/thread/80378d6700e6f179/
HTH
--
schilke
Or are you asking for a way to get search engines to understand that
'Naughty' is in the 'Stories about the kid', 'Woman' is in 'John
Lennon - Woman', and that 'Sex' is in 'Registration Form'?
I am thinking that would take getting Google and the others to have
the bot look for <![CDATA[
var version = {title: "TiddlyWiki", major: 2, minor: 2, revision: 4,
date: new Date("Jun 19, 2007"), extensions: {}};
//]]> (or some such thing. If such a designation doesn't exist maybe
it needs to be added to HTML 5 to cover all the wikis and such out
there. 'Format="TiddlyWiki"' maybe?) and then read the file
accordingly.
Ken Girard
you're right - the search engine spiders don't understand any
javascript - if it's properly quoted according (x)HTML specs, the
javascript source shouldn't show up at all.
The other thing is, that the spiders will be able to get all your
content - but they must be served the proper links (in case of TW:
http://almostporn.net/#SiteMap) the domain or the start page won't
help, because the spiders will only get the DefaultTiddlers and no
links to follow - this will be an initial minus. Furthermore, if there
is no interesting content, that's the secon big minus - the site won't
be visited regularly and won't receive an acceptable ranking (if any).
The links to pages of your sites seem to be spread all over the net -
Google Groups, Forums etc. - therefore your pages are in the index -
it's as simple as that ;-)
--
schilke
LOL, Ken, if I will be out of ideas regarding domain names, I will ask
you...
--
schilke
Ken Girard
No secret at all - Google for example did mention that in several
publications: "link popularity" is most important for a good ranking -
what sounds absolutely reasonable: if a lot of sites link to yours,
there has to be good stuff on it (it's also not a secret, that Google
rates a bit different depending on the "linking sites" - if your site
is linked by, say mozilla.org or w3.org, Stanford or Harvard, that
would dramatically increase your page ranking ...).
When the number of abusers raised ("search engine professionals" - the
evil ones - if you know what I mean), Google tried to fix that with a
special link which told the spiders, not to follow those or at least
don't count them for page rank... (but they weren't able to make that
method popular enough so they use other methods now...)
--
schilke
On Aug 27, 3:19 pm, schilke <googlegroups.tt.ch...@xoxy.net> wrote:
> this should be of some help: "SEO TiddlyWiki Plugin"http://www.superphysique.net/#%5B%5BSEO%20TiddlyWiki%20Plugin%5D%5D
I'm not too hot for a solution that means indexing a URL that is not
tiddlywiki. I can do that
by running "tiddlywiki_cp -a foo.html foo" and populate directory foo
with one file per tiddler.
Is there not a clever trick for non-javascript enabled browsers to see
all the content ?
directly - no. Due to the fact that the content _is displayed by_
javascript - that means: no javascript - no content...
Err... that said it turns out, that my previous remarks on the tiddler
links - http://example.com/mytiddlywiki.html#ThisIsATiddlerLink - have
been nonsense...
Normally there should be an alternative left: XML - if you turn on the
feed generation this should help, because spiders also follow links in
xml files. But stop: it does help to generate a sitemap. Unfortunately
the search engine spiders will notice, that all the links in that file
lead to one single html file.
So you still would have to generate true html pages...
That's why the seo plugin should be a helpful thing: it creates the
html pages plus the simple javascript redirect.
The search engine spiders will read the html content while visitors
with js enabled browsers will be redirected to the proper tiddlers...
You can discover how the plugin works by viewing Fabrice's pages (by
disabling/enabling javascript of course):
http://www.google.com/search?source=ig&q=site%3Asuperphysique.net
BTW: although tiddlywiki.com should be a heavy load site with a ton of
backlinks, there are just around 40-50 pages indexed by Google.
Between 300 and 400 with tiddlyspot.com - but if you strip off the
blog (which is Wordpress - not TiddlyWiki) about 20 are left ;-)
HTH
--
schilke