Douplicates in table 'wp'

6 views
Skip to first unread message

Dashamir Hoxha

unread,
Oct 20, 2011, 11:56:42 AM10/20/11
to open...@googlegroups.com
Hello,

I would like to create a drupal module which works
like a dictionary of previous translations, just like
http://open-tran.eu/

However I would like to extend it so that people can also vote
which translation is better, and even suggest new translations
that they think are suitable. This voting input (feedback) will
be collected using social networks (like facebook, G+, twitter,
etc.) or by a simple email interface.

It should be possible also to append new (untranslated) strings
to the database, and after some time we can have some translation
suggestions on these strings, and the corresponding votes (or Like-s)
on each of them.

I have created a sandbox project at drupal.org:
http://drupal.org/sandbox/dashohoxha/1305466
http://drupal.org/project/1305466/git-instructions
http://drupal.org/node/1305466/commits

If you are interested, you can follow the project development by
subscribing to this RSS:
http://drupalcode.org/sandbox/dashohoxha/1305466.git/rss

Actually I don't expect anybody to help with coding, however
since you have experience with these problems maybe you can help
with any advice or suggestions.

I have already downloaded the data of open-tran.eu and I have
played around a little bit with them. And I have found something
that either I don't understand properly or it is an error.

My understanding is that (phraseid,wordid) on the table 'wp' is
a primary key. If a word occurs more than once in a phrase, then
the 'count' field shows it. However I have found duplicates of
(phraseid,wordid) in this table. These are listed in the files
attached.

Can you help me, is this an error, or I am missing something?

Thanks,
Dashamir

--
Dashamir Hoxha
GPG: 4F97 4EDE C739 4C3D 361E 16D3 FD06 AA8E 55D5 9B28
In God we trust. Everybody else we verify using GnuPG!

wp_doublicates.csv
wp_doublicates.ods
signature.asc

Dashamir Hoxha

unread,
Oct 20, 2011, 6:16:24 PM10/20/11
to open...@googlegroups.com
On 10/20/2011 05:56 PM, Dashamir Hoxha wrote:

> My understanding is that (phraseid,wordid) on the table 'wp' is
> a primary key. If a word occurs more than once in a phrase, then
> the 'count' field shows it. However I have found duplicates of
> (phraseid,wordid) in this table. These are listed in the files
> attached.
>
> Can you help me, is this an error, or I am missing something?

I just remembered that I have initially found duplicates in
phrases and I have tried to unify them in one, adjusting the
relationships as well. So, this may be the cause of the duplicates
in 'wp'.

The duplicates in phrases probably are created because I trim
the phrases. It also depends on the way of comparison used.

Regards,

signature.asc

Jacek Śliwerski

unread,
Oct 24, 2011, 2:29:56 PM10/24/11
to open...@googlegroups.com, dasho...@gmail.com
Your understanding of the design is correct. If there are duplicates
in the wp table, then it is a mistake. But it would take me a while to
determine the cause of this issue and I haven't had enough time
recently.

Jacek

Reply all
Reply to author
Forward
0 new messages