search engine tiddlywiki issue

57 views
Skip to first unread message

ldachary

unread,
Aug 27, 2007, 8:24:33 AM8/27/07
to TiddlyWiki
Hi,

I know this is a recuring question but I see it was not adressed in
2007 :-) How can I arrange for the tiddlywiki content to be visible to
search engines ? Currently all they get is something like : //\n\n",
MarkupPostHead: "", MarkupPreBody: "", MarkupPostBody: "",
TabTimeline: '<>', TabAll: '<>', TabTags: '<>', TabMoreMissing: '<>',
TabMoreOrphans: '<>', TabMoreShadowed: '<>', AdvancedOptions:
'<>', ...

Thanks in advance for your help

schilke

unread,
Aug 27, 2007, 9:19:28 AM8/27/07
to TiddlyWiki
> How can I arrange for the tiddlywiki content to be visible to
> search engines ?

this should be of some help: "SEO TiddlyWiki Plugin"
http://www.superphysique.net/#%5B%5BSEO%20TiddlyWiki%20Plugin%5D%5D

Plus - there are some related threads:
"Is it possible to list all tiddlers as a list of permalinks? -
TiddlyWiki | Google Groups"
http://groups.google.com/group/TiddlyWiki/browse_frm/thread/ed9772d8287f8a1c/
"google indexing - any luck? - TiddlyWiki | Google Groups"
http://groups.google.com/group/TiddlyWiki/browse_frm/thread/f972ed85646f8db3/
"TiddlyWiki and SEO - TiddlyWiki | Google Groups"
http://groups.google.com/group/TiddlyWiki/browse_frm/thread/a38f0db45c1fd1e8/
"SEO TiddlyWiki Plugin - TiddlyWiki | Google Groups"
http://groups.google.com/group/TiddlyWiki/browse_frm/thread/80378d6700e6f179/

HTH

--
schilke

Ken Girard

unread,
Aug 27, 2007, 10:02:27 AM8/27/07
to TiddlyWiki
OK, I guess this is something I am not getting.
I have a TW website that is getting 20-50 hits a day from Google and
Yahoo, and quite a few less from MSN and others. From this I take it
that the search engines know I am there, and what content I have
(Actually I think that is part of the problem. Having around 1000
tiddlers means that I get hits for every thing under the sun.
'Naughty' in one tiddler, 'Woman' in a second, 'Sex' in a third. They
all look alike to a bot.)

Or are you asking for a way to get search engines to understand that
'Naughty' is in the 'Stories about the kid', 'Woman' is in 'John
Lennon - Woman', and that 'Sex' is in 'Registration Form'?

I am thinking that would take getting Google and the others to have
the bot look for <![CDATA[
var version = {title: "TiddlyWiki", major: 2, minor: 2, revision: 4,
date: new Date("Jun 19, 2007"), extensions: {}};
//]]> (or some such thing. If such a designation doesn't exist maybe
it needs to be added to HTML 5 to cover all the wikis and such out
there. 'Format="TiddlyWiki"' maybe?) and then read the file
accordingly.

Ken Girard

schilke

unread,
Aug 27, 2007, 11:19:29 AM8/27/07
to TiddlyWiki
> OK, I guess this is something I am not getting

you're right - the search engine spiders don't understand any
javascript - if it's properly quoted according (x)HTML specs, the
javascript source shouldn't show up at all.

The other thing is, that the spiders will be able to get all your
content - but they must be served the proper links (in case of TW:
http://almostporn.net/#SiteMap) the domain or the start page won't
help, because the spiders will only get the DefaultTiddlers and no
links to follow - this will be an initial minus. Furthermore, if there
is no interesting content, that's the secon big minus - the site won't
be visited regularly and won't receive an acceptable ranking (if any).

The links to pages of your sites seem to be spread all over the net -
Google Groups, Forums etc. - therefore your pages are in the index -
it's as simple as that ;-)

--
schilke

schilke

unread,
Aug 27, 2007, 11:24:45 AM8/27/07
to TiddlyWiki
and slightly off-topic:

> http://almostporn.net/

http://no-sin.com/

LOL, Ken, if I will be out of ideas regarding domain names, I will ask
you...

--
schilke

Ken Girard

unread,
Aug 27, 2007, 1:21:43 PM8/27/07
to TiddlyWiki
The true secret to getting a page to be know and indexed: Post lots of
links to it. Do not post them in a fashion that makes the bots think
are link spam.
I 'promoted' a friend's brand new domain up to a page rank of 3 in
about a month by doing nothing but making my normal post in blogs all
across the net. He still had no real content (3 brief pages on how to
make beer), but now folks could find it.

Ken Girard

schilke

unread,
Aug 27, 2007, 3:31:18 PM8/27/07
to TiddlyWiki
> The true secret to getting a page to be know and indexed: Post lots of
> links to it

No secret at all - Google for example did mention that in several
publications: "link popularity" is most important for a good ranking -
what sounds absolutely reasonable: if a lot of sites link to yours,
there has to be good stuff on it (it's also not a secret, that Google
rates a bit different depending on the "linking sites" - if your site
is linked by, say mozilla.org or w3.org, Stanford or Harvard, that
would dramatically increase your page ranking ...).

When the number of abusers raised ("search engine professionals" - the
evil ones - if you know what I mean), Google tried to fix that with a
special link which told the spiders, not to follow those or at least
don't count them for page rank... (but they weren't able to make that
method popular enough so they use other methods now...)

--
schilke

ldachary

unread,
Aug 28, 2007, 8:10:19 AM8/28/07
to TiddlyWiki
Thanks for the links.

On Aug 27, 3:19 pm, schilke <googlegroups.tt.ch...@xoxy.net> wrote:
> this should be of some help: "SEO TiddlyWiki Plugin"http://www.superphysique.net/#%5B%5BSEO%20TiddlyWiki%20Plugin%5D%5D

I'm not too hot for a solution that means indexing a URL that is not
tiddlywiki. I can do that
by running "tiddlywiki_cp -a foo.html foo" and populate directory foo
with one file per tiddler.

Is there not a clever trick for non-javascript enabled browsers to see
all the content ?

schilke

unread,
Aug 28, 2007, 11:30:09 AM8/28/07
to TiddlyWiki
> Is there not a clever trick for non-javascript enabled browsers to see
> all the content ?

directly - no. Due to the fact that the content _is displayed by_
javascript - that means: no javascript - no content...

Err... that said it turns out, that my previous remarks on the tiddler
links - http://example.com/mytiddlywiki.html#ThisIsATiddlerLink - have
been nonsense...

Normally there should be an alternative left: XML - if you turn on the
feed generation this should help, because spiders also follow links in
xml files. But stop: it does help to generate a sitemap. Unfortunately
the search engine spiders will notice, that all the links in that file
lead to one single html file.
So you still would have to generate true html pages...

That's why the seo plugin should be a helpful thing: it creates the
html pages plus the simple javascript redirect.
The search engine spiders will read the html content while visitors
with js enabled browsers will be redirected to the proper tiddlers...

You can discover how the plugin works by viewing Fabrice's pages (by
disabling/enabling javascript of course):
http://www.google.com/search?source=ig&q=site%3Asuperphysique.net

BTW: although tiddlywiki.com should be a heavy load site with a ton of
backlinks, there are just around 40-50 pages indexed by Google.
Between 300 and 400 with tiddlyspot.com - but if you strip off the
blog (which is Wordpress - not TiddlyWiki) about 20 are left ;-)

HTH

--
schilke

Reply all
Reply to author
Forward
0 new messages