Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Faroese country names

8 views
Skip to first unread message

Canis

unread,
Dec 8, 2005, 6:20:43 PM12/8/05
to
Welcome all!
I'm searching names of this countries in faroese:
For all only long names.
*Czech Republic (not Czechia!)
*Russian Federation (not Russia!)
*Slovak Republic (not Slovakia!)

in lower sorbian and faroese:
*German Democratic Republic
*Union of Soviet Socialist Republics

only in lower sorbian:
*Federal Republic of Germany
*Former Yugoslav Republic of Macedonia

And in albanian, belarusian, faroese, lower sorbian & maltese
*Turkish Republic of Northern Cyprus

and link for country names in Upper Sorbian (hornjoserbsce)
Is anybody who's can help - please for informations.

Jukka K. Korpela

unread,
Dec 9, 2005, 2:57:35 AM12/9/05
to
In sci.lang under Subject: Re: Faroese country names
Canis wrote:

> I'm searching names of this countries in faroese:
> For all only long names.
> *Czech Republic (not Czechia!)
> *Russian Federation (not Russia!)
> *Slovak Republic (not Slovakia!)

Are you sure you want the _long_ names? They are supposed to be used
only in official documents when referring to states as political
entities, actors in international relationships.

Anyway, the short, common names in Faroese of the Czech Republic and
Russia are "Kekkia" and "Russland", according to
http://unicode.org/cldr/data/common/main/en.xml
(If you find an error in that data, please submit a bug report, see
http://unicode.org/cldr/ )

> in lower sorbian and faroese:
> *German Democratic Republic
> *Union of Soviet Socialist Republics
>
> only in lower sorbian:
> *Federal Republic of Germany
> *Former Yugoslav Republic of Macedonia
>
> And in albanian, belarusian, faroese, lower sorbian & maltese
> *Turkish Republic of Northern Cyprus

You seem to have a rather mixed list here, containing historical
countries, a "politically correct" awkward name for an existing country
(Macedonia), and a name for an entity that has not been recognized as a
country.

> and link for country names in Upper Sorbian (hornjoserbsce)

That might be difficult to find.

You might find some other names you're looking for at
http://www.geonames.de/
(which is a personal creation, so the information has not been reviewed
in any official way).

I've fixed the Subject line (the original one was misleading, since it
was too restrictive) and set followups to sci.lang.translation, which is
more appropriate here than the general sci.lang group.

Martin Möller

unread,
Dec 9, 2005, 7:20:03 AM12/9/05
to
Hello,

guessing from the dictionary

http://www.dolnoserbski.de/dnw/index.htm

Federal Republic of Germany should be something like
Zwezkowa republika Nimska

in lower Sorbian...

(The e in Zwezkowa has a upside down accent circonflex (no idea what the
technical term is). I mention this as the accent is probably spoiled by the
ascii format of the usenet.)

Perhaps you should ask that institute
http://www.dolnoserbski.de/dnw/index.htm directly for the country names you
need!

Regards
Martin

"Canis" <fd...@poczta.onet.pl> schrieb im Newsbeitrag
news:1134084042....@g43g2000cwa.googlegroups.com...

Peter T. Daniels

unread,
Dec 9, 2005, 8:56:15 AM12/9/05
to
Martin Möller wrote:
>
> Hello,
>
> guessing from the dictionary
>
> http://www.dolnoserbski.de/dnw/index.htm
>
> Federal Republic of Germany should be something like
> Zwezkowa republika Nimska
>
> in lower Sorbian...
>
> (The e in Zwezkowa has a upside down accent circonflex (no idea what the
> technical term is). I mention this as the accent is probably spoiled by the
> ascii format of the usenet.)

hachek (Unicode calls it "caron," I know not why, and phoneticians often
call it "wedge")
--
Peter T. Daniels gram...@att.net

Paul J Kriha

unread,
Dec 10, 2005, 4:43:45 AM12/10/05
to

Peter T. Daniels <gram...@worldnet.att.net> wrote in message
news:43998D...@worldnet.att.net...

Is that how it's commonly spelled? Hachek?

The CED spells it hacek, but (for crying out loud) it goes
a step further. It spells it with the (correct) diacritics.
There is a hacek above "c" in "hacek" :-) to mark the palatal
"c" and "carka" above "a" to mark a long vowel "a".

I suppose ASCII "hachek" results in the closest approximation
of the original CZ pronunciation without going into diacritics
or IPA.

pjk

> Peter T. Daniels


zbihniew

unread,
Dec 10, 2005, 12:15:51 PM12/10/05
to
Użytkownik Peter T. Daniels napisał:

> hachek (Unicode calls it "caron," I know not why, and phoneticians often
> call it "wedge")

"hachek" means "a little hook" and it's correctly spelt as "hacek" with
a hacek over "c".

Des Small

unread,
Dec 10, 2005, 12:18:23 PM12/10/05
to
zbihniew <zbih...@ask.me> writes:

viz. "haček".

Des
is curious what encoding that will come out in


--

Thomas Widmann

unread,
Dec 10, 2005, 12:44:31 PM12/10/05
to
Des Small <vonb...@yahoo.co.uk> writes:

> zbihniew <zbih...@ask.me> writes:
>
>> Użytkownik Peter T. Daniels napisał:
>>
>> > hachek (Unicode calls it "caron," I know not why, and phoneticians often
>> > call it "wedge")
>>
>> "hachek" means "a little hook" and it's correctly spelt as "hacek"
>> with a hacek over "c".
>
> viz. "haček".

But it should be "háček", methinks.

> is curious what encoding that will come out in

UTF-8.

/Thomas
--
Thomas Widmann tw...@bibulus.org http://www.twid.bibulus.org
Flat 0/1, 57 Rose Street, Garnethill, Glasgow G3 6SF, Scotland, EU

Brian M. Scott

unread,
Dec 10, 2005, 12:47:40 PM12/10/05
to
On Sat, 10 Dec 2005 17:44:31 +0000, Thomas Widmann
<tw...@bibulus.org> wrote in
<news:m38xusx...@alcedo.bibulus.org> in sci.lang:

> Des Small <vonb...@yahoo.co.uk> writes:

[...]

>> viz. "haček".

> But it should be "háček", methinks.

>> is curious what encoding that will come out in

> UTF-8.

And so was yours. But I'll bet that mine makes it
ISO-8859-2.

Brian

Thomas Widmann

unread,
Dec 10, 2005, 1:03:38 PM12/10/05
to

Absolutely correct!

zbihniew

unread,
Dec 10, 2005, 2:59:15 PM12/10/05
to
Użytkownik Thomas Widmann napisał:

>
> But it should be "háček", methinks.

I agree. Sorry. Forgot about that accent.

zbihniew

unread,
Dec 10, 2005, 3:06:22 PM12/10/05
to

> viz. "haček".
>
> Des
> is curious what encoding that will come out in

Unicode (eg. UTF-8) and Eastern European encodings (standard ISO-8859-2,
microsoftian Windows-1250 and DOS CodePage 852).

Paul J Kriha

unread,
Dec 11, 2005, 1:47:48 AM12/11/05
to

Des Small <vonb...@yahoo.co.uk> wrote in message news:yyrj64pw...@pc156.maths.bris.ac.uk...

It arrived to my abode with a header param charset=UTF-8,
i.e. 8 bit unicode. It looks perfectly fine on my screen.


Just one little niggle though....

Zbihniew's "haček" is spelled incorrectly.
A correct original spelling is "háček" with a long-vowel
marker (called čárka) above the first "a".

I suggest, in English texts it should be spelled either
"hacek" or "háček". If the Czech diacritics are to be
included then they all should be present. They are all
phonemicly significant. Spelling "háček" with a wrong
letter as "haček" turns it into gibberish.

If somebody goes into trouble of figuring out how to
include a "č" in his handywork then surely producing
an "á" (an even more commonly available letter)
should be relatively a piece of cake.

pjk
is happy not having to mention diacritics of a long "u"


Peter T. Daniels

unread,
Dec 11, 2005, 10:48:54 AM12/11/05
to
Thomas Widmann wrote:
>
> "Brian M. Scott" <b.s...@csuohio.edu> writes:
>
> > On Sat, 10 Dec 2005 17:44:31 +0000, Thomas Widmann
> > <tw...@bibulus.org> wrote in
> > <news:m38xusx...@alcedo.bibulus.org> in sci.lang:
> >
> >> Des Small <vonb...@yahoo.co.uk> writes:
> >
> > [...]
> >
> >>> viz. "haÄ*ek".
> >
> >> But it should be "háÄ*ek", methinks.

> >
> >>> is curious what encoding that will come out in
> >
> >> UTF-8.
> >
> > And so was yours. But I'll bet that mine makes it
> > ISO-8859-2.
>
> Absolutely correct!

And now everyone understands why <hachek> is preferable.

No guesses about "caron"?

Jukka K. Korpela

unread,
Dec 11, 2005, 11:18:51 AM12/11/05
to
Under Subject: Re: Faroese country names
"Peter T. Daniels" <gram...@worldnet.att.net> wrote:

> No guesses about "caron"?

Presumably you mean the origin (etymology) of the word. The name "caron" is
used in character standards for some odd reason, and the origin is unknown:

"Whoever the originator is, we suspect that he has probably taken his secret
to the grave by now."
http://www.unicode.org/faq/casemap_charprop.html#14

--
Yucca, http://www.cs.tut.fi/~jkorpela/

Brian M. Scott

unread,
Dec 11, 2005, 11:55:54 AM12/11/05
to
On Sun, 11 Dec 2005 15:48:54 GMT, "Peter T. Daniels"
<gram...@worldnet.att.net> wrote in
<news:439C4A...@worldnet.att.net> in sci.lang:

> Thomas Widmann wrote:

>>>> Des Small <vonb...@yahoo.co.uk> writes:

>>> [...]

>>>>> viz. "haÄ*ek".

>>>> UTF-8.

>> Absolutely correct!

Well, no: whether it was encoded as UTF-8 or as ISO-8859-2,
competent newsreaders on two different operating systems
seem to have interpreted it correctly. Be damned if I can
tell what yours did to it, though it looks quite delightful
when viewed on the assumption that it's Mac Thai!

> No guesses about "caron"?

Brian

Peter T. Daniels

unread,
Dec 11, 2005, 11:57:24 AM12/11/05
to

Typical Unicode "scholarship." Adopt (or invent) a term with no
comprehension at all.

benlizross

unread,
Dec 11, 2005, 1:39:39 PM12/11/05
to

Bastard offspring of "caret" and "macron"??

Ross Clark

Jukka K. Korpela

unread,
Dec 11, 2005, 3:32:59 PM12/11/05
to
Peter T. Daniels wrote:

>>"Whoever the originator is, we suspect that he has probably taken his secret
>>to the grave by now."
>>http://www.unicode.org/faq/casemap_charprop.html#14
>
> Typical Unicode "scholarship." Adopt (or invent) a term with no
> comprehension at all.

Actually, if you check the Unicode FAQ entry I referred to, you'll see
that the name "caron" was invented in the 1980s, i.e. before the Unicode
Consortium was even founded. As the entry says, "Unicode and ISO 8859-x
just carried the tradition along".

Peter T. Daniels

unread,
Dec 11, 2005, 6:01:45 PM12/11/05
to

The parenthesized words were in parentheses for a reason.

Hachek was a real word, in a real language, with a real referent. Why
not use it?

Joe Fineman

unread,
Dec 11, 2005, 8:04:19 PM12/11/05
to
"Jukka K. Korpela" <jkor...@cs.tut.fi> writes:

> http://www.unicode.org/faq/casemap_charprop.html#14

I was astonished to see "clicka" mentioned as yet another name. I
used to call the mark "klicka" (with a hacek over the c), thinking
that that was the Czech name. When I found out about "hacek", I
concluded that I must have invented "klicka" in a hazy moment.
Perhaps not.
--
--- Joe Fineman jo...@verizon.net

||: Anger is ice for the toothache of shame. :||

Paul J Kriha

unread,
Dec 12, 2005, 12:13:00 AM12/12/05
to

Peter T. Daniels <gram...@worldnet.att.net> wrote in message
news:439C4A...@worldnet.att.net...

You don't need to change the word's spelling to such a degree.
Just dropping the diacritics solves the above problems.
So, <hacek> <carka> <krouzek> won't get mucked up.

pjk

Paul J Kriha

unread,
Dec 12, 2005, 1:22:50 AM12/12/05
to

Joe Fineman <jo...@verizon.net> wrote in message news:u64pv3...@verizon.net...


"klic^ka" is a diminutive of "klika" (a handle).
If I came across the word "klic^ka" in the context of
handwriting or printing I would assume it meant some
kind of wavy, squiggly, little-handle, z-like shape.
Without any other clues I'd guess it referred to a tilde.

The hare running away from danger "klic^kuje" (3p. sng. masc).

The diminutive noun "klic^ka" is also used to refer to
the shape formed by the tied shoelaces.

I haven't come across "klic^ka" used in place of "ha'c^ek".
If it was used that way by a Czech, I'd say it was a part
of a localized temporary student jargon. Something like
"zavina'c^" (roll mop) referring to an ampersand.

pjk


Peter T. Daniels

unread,
Dec 12, 2005, 9:20:55 AM12/12/05
to
Paul J Kriha wrote:

> > And now everyone understands why <hachek> is preferable.
>
> You don't need to change the word's spelling to such a degree.
> Just dropping the diacritics solves the above problems.
> So, <hacek> <carka> <krouzek> won't get mucked up.

<hacek> without a hachek wouldn't be pronounced ['hatSek]. You'd get
['hasEk], ['kark@], and ['kruwzEk], which probably aren't right.

0 new messages