Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Russian Language Word Processing

49 views
Skip to first unread message

Ajit Dingankar

unread,
Apr 4, 1991, 4:02:59 PM4/4/91
to
Greetings!

I was wondering if anyone out there in the NetLand uses Russian
word processors. Any info about PC packages / LaTeX fonts would be
appreciated (how does TeX handle the extended alphabet, if the PC needs
to have a special configuration for the keyboard etc).

Pazhaloosta,
Ajit


Asya Kamsky

unread,
Apr 4, 1991, 6:46:58 PM4/4/91
to

I'm translating a book into Russian on my PC and I settled on a
package called MULTIWRITE. It comes with Cerylic as well as
Greek, Arabic, Hebrew and maybe some other fonts. It's a pretty
decent word-processor and I don't think you even need EGA/VGA
for it, just Hercules.

I used another one which was abbreviated DJ (don't know the full
name of it) and it was horribly buggie.

--
"She's making movies on location she don't know what it means"

Asya Kamsky e-mail: as...@ux5.lbl.gov Go Red Sox
(415)486-7428 other: Lawrence Berkeley Labs, 50/232 Die Yuppie Scum

George Antony

unread,
Apr 4, 1991, 10:12:17 PM4/4/91
to
From article <46...@ut-emx.uucp>, by aj...@ut-emx.uucp (Ajit Dingankar):

Check out this for yourself, as I haven't yet had the opportunity to
do so.

This came from soc.culture.magyar, but appears to be applicable more
widely.


Diplomat Software, Inc.
"Alphabets of the World for your PC"

3700 Campus Drive
Suite 202
Newport Beach, CA 92660

Phone: (714)474-6968, Fax: (714)250-8117

This program apparently "supports all Eastern-European languages and most
leading PC word processors and desktop publishers": sounds like an add-in
program that lets you use your favourite text-basher.

Cheers,

George
-----------------------------------------------------------
George Antony (gan...@gara.une.oz.au)
Department of Agricultural Economics and Business Management
University of New England, Armidale, N.S.W. 2351, Australia
Fax: (+61-67) 711-531 Phone: (+61-67) 733-222
------------------------------------------------------------

Brian Wilford

unread,
Apr 5, 1991, 9:29:31 AM4/5/91
to
In article <46...@ut-emx.uucp> aj...@ut-emx.uucp (Ajit Dingankar) writes:

I have been using a program formerly called Cornell Russian Support (I think itsname has since been modified) available through Exceller Software, Ithica, NY. The program works on top of WordPerfect allowing you to display the Cyrilic alphabet on the screen, print Russian on both dot matrix and laser printers and it offers you two keyboard configurations--the standard soviet config. and a so-called student config. which is preferable for those who are accustomed to typing inEnglish. The program also allo


ws you to switch between English and Russian within the same document via a simple keystroke. Although the program is configuredto work immediately on top of WordPerfect it also can be modified to work on topof other popular word processing programs. The last time I checked I believe
the price was around $100, a separate package of proportional Russian and
English laser fonts is also available for about $100.

Brian Wilford

Dimitri Vulis, CUNY GC Math

unread,
Apr 6, 1991, 12:16:59 AM4/6/91
to
Indeed, such beast exists, and unlike other software advertised on this
newsgroup, it can be had for free. LIST...@UBVM.CC.BUFFALO.EDU has
(in RUSTEX-L FILELIST) two sets of TeX fonts, TeX hyphenation patterns for
Russian, and assorted hacks to display Russian (and some other Cyrillic-based
scripts) on assorted hardware.

Note: the stuff is free, but you have to possess some intelligence to install
it. :)

Dimitri Vulis, D&M
BITNET: DLV@CUNYVMS1
Internet: D...@CUNYVMS1.GC.CUNY.EDU
Snail: Department of Mathematics/Box 330
City University of New York Graduate Center
33 West 42 Street
New York, NY 10036-8099
USA

Calsyn Martin Robert

unread,
Apr 8, 1991, 10:18:05 AM4/8/91
to
In article <1991Apr5.1...@dartvax.dartmouth.edu> al...@eleazar.dartmouth.edu (Alex Hartov) writes:
>I use a Mac with russian fonts. You can use any word processor you like then.

The same should be true of MS-Windows 3.00. I am working up display
fonts for MS-Word for Windows based on ones I use on the Mac. If I have
any luck, I'll post and offer the fonts for anonymous ftp.

>On another topic, is there a standard way to transliteralte written russian
>into english alphabet. Better yet, is there a standard defined for russian
>letters? such as ascii etc?

I seem to remember some back issues of the AATSEEL journal which
contained a coding/collating sequence for Cyrillic. I have used this
sequence in several applications. The proposed collating sequence (is
it a standard yet?) insures that standard sorting and comparison
algorithms result in a proper ordering of Russian words. These same
issues outlined the Soviet and student keyboard layouts. I will attempt
to locate these back issues and post the info here.

Arkasha

--
Martin R. Calsyn mar...@iastate.edu
Project Vincent Systems Support gg....@isumvs.bitnet
Iowa State University Computation Center voice: (515) 294-9889
Ames, Iowa 50011 fax: (515) 294-1717

Dimitri Vulis, CUNY GC Math

unread,
Apr 7, 1991, 11:54:04 PM4/7/91
to
A clirification is in order:

There's no anonymous FTP on UBVM.CC.BUFFALO.EDU. All RusTeX-L-related files
need to be retrieved _by_e-mail_ from LIST...@UBVM.CC.BUFFALO.EDU.

Some of the files have been also mirrored on ACCUVAX.NWU.EDU by Jacob Gore.
One can use anonymous FTP to retrieve them from there. However ACCUVAX
doesn't contain the list archives and non-TeX-related files (such as
the Russian typing tutor). I'd like to move the whole collection to the .SU,
and to maintain the Western counterpart only as a mirror.
Here's a copy of Jacob's message:

From: Jacob Gore <ja...@BLACKBOX.GORE.COM>
Subject: Contents of accuvax ftp archive

Size Date Name Brief description

pub/rustex:

3764 02/01/91 CONTENTS This file
640 11/01/89 README About this archive
dir 11/12/89 documentation Documents and explanations
dir 11/02/89 dos For using cyrillic on MS-DOS
dir 12/15/89 fonts-ifve TeX fonts by Inst. of High Energy Physics, USSR
dir 01/31/91 fonts-postscript PostScript fonts
dir 01/31/91 fonts-uwash TeX fonts by University of Washington
dir 02/14/90 formats Preloaded TeX formats
dir 02/14/90 inputs Files for inclusion into TeX or LaTeX documents
dir 11/02/89 programs Utility programs
dir 11/02/89 vt220 For using cyrillic on DEC VT220 terminals

pub/rustex/documentation:

10449 11/12/89 LetterMappings GOSTCII, EBCDIC CP 500, DKOI, KOI-8
50012 11/02/89 rsam.ter A GOSTCII encoded TeX document, for testing

pub/rustex/dos:

15064 11/02/89 dosgost.arc For using cyrillic on MS-DOS (Dimitri's stuff)

pub/rustex/fonts-ifve:

dir 10/31/89 mf Metafont sources
dir 11/01/89 tfm TeX Font Metric files generated from mf

pub/rustex/fonts-ifve/mf:

404 10/31/89 INDEX
613 10/31/89 README
29475 10/31/89 citall.mf
3700 10/31/89 cmcbx10.mf
3712 10/31/89 cmcbxsl10.mf
3726 10/31/89 cmcbxti10.mf
3718 10/31/89 cmcitt10.mf
3702 10/31/89 cmcsl10.mf
3689 10/31/89 cmcss10.mf
3717 10/31/89 cmcssdc10.mf
3712 10/31/89 cmcti10.mf
3702 10/31/89 cmctt10.mf
3690 10/31/89 cmcyr10.mf
1398 10/31/89 coding.mf
9345 10/31/89 cyrillic.mf
43614 10/31/89 cyrl.mf
44699 12/05/89 cyru.mf
999 10/31/89 cytextit.mf

pub/rustex/fonts-ifve/tfm:

74 11/01/89 README
1468 11/01/89 cmcbx10.tfm
1636 11/01/89 cmcbxsl10.tfm
1040 11/01/89 cmcbxti10.tfm
740 11/01/89 cmcitt10.tfm
1588 11/01/89 cmcsl10.tfm
1492 11/01/89 cmcss10.tfm
1004 11/01/89 cmcti10.tfm
736 11/01/89 cmctt10.tfm
1432 11/01/89 cmcyr10.tfm

pub/rustex/fonts-postscript:

17321 01/16/91 CyrG12.bdf Jay Sekora's font
25487 01/16/91 CyrG24.bdf
34918 01/16/91 CyrillicGothic
7928 01/31/91 CyrillicGothic.README
7783 01/16/91 CyrillicGothic.afm

pub/rustex/fonts-uwash:

dir 12/16/89 mf

pub/rustex/fonts-uwash/mf:

6029 12/15/89 cyrcsc.mf
5959 12/16/89 cyrcsc.mf.old
4816 12/15/89 cyrfont.mf
27952 12/15/89 cyrilu.mf
4198 12/15/89 cyrital.mf
6699 12/15/89 cyrmax.mf
32212 12/15/89 cyrpunc.mf
20903 12/15/89 cyrspl.mf
11766 12/15/89 cyrspu.mf
47741 12/15/89 cyrti.mf
24537 12/15/89 serb.mf
6931 12/15/89 serbspu.mf
5504 12/15/89 wncyb10.mf
5743 12/15/89 wncyb5.mf
5472 12/15/89 wncyb6.mf
5780 12/15/89 wncyb7.mf
5453 12/15/89 wncyb8.mf
5449 12/15/89 wncyb9.mf
3897 12/15/89 wncyi10.mf
3877 12/15/89 wncyi5.mf
3859 12/15/89 wncyi6.mf
3924 12/15/89 wncyi7.mf
3729 12/15/89 wncyi8.mf
3725 12/15/89 wncyi9.mf
5617 12/15/89 wncyr10.mf
5680 12/15/89 wncyr5.mf
5575 12/15/89 wncyr6.mf
5697 12/15/89 wncyr7.mf
5311 12/15/89 wncyr8.mf
5359 12/15/89 wncyr9.mf
5638 12/15/89 wncysc10.mf
5547 12/15/89 wncyss10.mf
5363 12/15/89 wncyss8.mf
5522 12/15/89 wncyss9.mf

pub/rustex/formats:

487104 02/14/90 rlplain.fmt Russian latex format (for pre-3.0 TeX)

pub/rustex/inputs:

27493 11/02/89 rhyphen.tex Dimitri's Russian hyphenation patterns
2010 02/14/90 rlplain.patch Diff from lplain.tex to make rlplain.tex
2174 11/02/89 rmacro.sty Maps 7-bit abbreviations to Russian characters
39987 02/14/90 rusfonts.tex Russian version of lfonts.tex

pub/rustex/programs:

748 11/02/89 gost2ascii.c Converts GOSTSCII to 7-bit abbreviations
6534 11/02/89 trr.pas Same thing, but in Pascal

pub/rustex/vt220:

6723 11/02/89 8X10.PAT
2563 11/02/89 PATTERN.FOR
2258 11/02/89 RUS.REA
288 11/02/89 RUSEDT.COM
15 11/02/89 RUSEDT.EVE
3754 11/02/89 RUSEDT.TPU
158 11/02/89 RUSEDT_I.COM
192 11/02/89 RUSEDT_M.COM
2059 11/02/89 RUS_CHAR
680 11/02/89 RUS_KEYB

Dimitri Vulis, CUNY GC Math

unread,
Apr 8, 1991, 12:07:48 AM4/8/91
to
In article <1991Apr5.1...@dartvax.dartmouth.edu>, al...@eleazar.dartmouth.edu (Alex Hartov) writes:
>On another topic, is there a standard way to transliteralte written russian
>into english alphabet. Better yet, is there a standard defined for russian
>letters? such as ascii etc?
You can get more information by looking thru RusTeX-L's archived messages,
but basically here's an answer:

There is no widely accepted way to unambiguously transliterate Russian
with ASCII letters. (Neither LC nor AMS schemes quite work.) There are more
problems if you're trying to transliterate text which is partly Russian and
partly other Cyrillic (e.g., we recently had to latinize a list of bibliography,
some of whose entries were Ukrainian; we didn't do such a good job).
There is a widely accepted way to map Russian letters into ASCII called KOI-7.

Letter GOST GOST KOI-7 KOI-7 AMS
upper lower upper lower transliteration
a \260 \320 a A A
be \261 \321 b b B
ve \262 \322 w W V
ghe \263 \323 g G G
de \264 \324 d D D
e \265 \325 e E E
yo \360 \361 # $ YO
zhe \266 \326 v V ZH
ze \267 \327 z Z Z
i \270 \330 i I I
short i \271 \331 j J \u{\I}
ka \272 \332 k K K
el \273 \333 l L L
em \274 \334 m M M
en \275 \335 n N N
o \276 \336 o O O
pe \277 \337 p P P
er \300 \340 r R R
es \301 \341 s S S
te \302 \342 t T T
u \303 \343 u U U
ef \304 \344 f F F
ha \305 \345 h H KH
tse \306 \346 c C TS
che \307 \347 ` ^ CH
sha \310 \350 { [ SH
shcha \311 \351 } ] SHCH
hard sign \312 \352 " _ ''
ery \313 \353 y Y Y
soft sign \314 \354 x X '
reverse e \315 \355 | \ \`{E}
yu \316 \356 ` @ YU
ya \317 \357 q Q YA

Note: yes, in KOI-7 lowercase Latin letters represent uppercase Russian letters
and vice versa.

It looks ugly, it's hard to read, but it's so widely used that you'll get
nowhere trying to improve it. :)

As for 8-bit extensions of ASCII, there are 3 common standards:

1. GOSTCII, aka ISO 8859 part 5. I recommend it. :)
2. KOI-8 from the old GOST. It's used internally by RELCOM, and has a nice
property that if you strip the 8th bit, you get KOI-7, which is still readable.
It's used by most Soviet UNIX boxes.
3. "Al'ternativnyj variant", used on most Soviet MS-DOS clones.

Assorted software for "Cyrillic WP" that floats around the US (Duang Jiang,
Diplomat, etc) all use their own 8-bit coding schemes not compatible with each
other. :) :) :)

The current ('86) GOST is given above; if you want the tables for the other
two, dig them up from r-t-l.

Andrew Arensburger

unread,
Apr 8, 1991, 1:22:44 PM4/8/91
to
In article <1991Apr8.1...@news.iastate.edu> mar...@iastate.edu (Calsyn Martin Robert) writes:
>In article <1991Apr5.1...@dartvax.dartmouth.edu> al...@eleazar.dartmouth.edu (Alex Hartov) writes:
>>I use a Mac with russian fonts. You can use any word processor you like then.
>
>The same should be true of MS-Windows 3.00.

I've tried this (on a Mac) and ran into the problem that some of
the letters were mapped to punctuation keys (e.g., the cyrillic 'b' was
the comma key). This led to some... interesting spacing problems. I also
tried to create a russian dictionary for the Word spell-checker, and
couldn't add any words that contained "punctuation" ('b' is the only
example I remember).
So while this may be a good short-term solution, you really need
to be able to remap the keyboard, or somehow explain to the editor that
those funny characters are really letters.
Moral: use either TeX or a specialized multi-language editor.
Avoid editors that are too smart for their own good.

--
-------------------------------------------------------------------\\\\^
Andrew Arensburger | K&R C! | I hate Lisp functions o\\\\\-
are...@cvl.umd.edu | ANSI no! | that start with /
...!uunet!mimsy!cvl!arensb | | "(catch (mapcon (throw" \_/

Calsyn Martin Robert

unread,
Apr 8, 1991, 1:03:54 PM4/8/91
to
I'm an author of text/literature/discourse analysis software which
provides concordances and frequency counts of words, phrases and
concepts as well as output for SAS, SPSS, etc. The product is currently
used for sociological research, law, politics, and marketing. I am
currently expanding this product to provide for multi-language support,
including Slavic languages.

I have some interest in keeping it compatible with the Soviet MS-DOS
clones. Can anyone provide a character code mapping for the display
charset on these clones? Are there technical resources on these clones
available in the U.S.?

Unfortunately the GOST, KOI and AMS sequences use a disjoint,
non-ascending sequence unsuitable for analytic purposes (without mapping
display codings to a separate collating sequence).

Also, my Russian is limited to a few years of college courses and thus
is devoid of technical terms. Any recommendations on a dictionary of
technical terms (particularly computer terms).

This project is not affiliated with Iowa State University -- standard
disclaimers apply.

Dimitri Vulis, CUNY GC Math

unread,
Apr 8, 1991, 3:16:15 PM4/8/91
to
In article <1991Apr8.1...@news.iastate.edu>, mar...@iastate.edu (Calsyn Martin Robert) writes:
>Unfortunately the GOST, KOI and AMS sequences use a disjoint,
>non-ascending sequence unsuitable for analytic purposes (without mapping
>display codings to a separate collating sequence).
1. This is factually incorrect. The new GOST has the contiguous uppercase
Russian alphabet in 0xB0--0xCF, followed by the contigious lowercase
alphabet in 0xD0--0xEF.
2. I am very skeptical of a "text analysis software" that can't handle
text whose collating sequence != arithmetic comparison. This means, it can't
also handle French, German, and other language that use extra letters and
accents, and have special collating sequences.

>
>Also, my Russian is limited to a few years of college courses and thus
>is devoid of technical terms. Any recommendations on a dictionary of
>technical terms (particularly computer terms).
Try Arkady Borkowsky's dictionary.

>
>This project is not affiliated with Iowa State University -- standard
>disclaimers apply.
Ah!!! A friend and colleague of Doug Jones, just as knowledgeable as he is!!! :)

>
>--
>Martin R. Calsyn mar...@iastate.edu
>Project Vincent Systems Support gg....@isumvs.bitnet
>Iowa State University Computation Center voice: (515) 294-9889
>Ames, Iowa 50011 fax: (515) 294-1717

Calsyn Martin Robert

unread,
Apr 9, 1991, 2:07:38 AM4/9/91
to
In article <1991Apr8.1...@timessqr.gc.cuny.edu> d...@cunyvms1.gc.cuny.edu writes:
>1. This is factually incorrect. The new GOST has the contiguous uppercase
>Russian alphabet in 0xB0--0xCF, followed by the contigious lowercase
>alphabet in 0xD0--0xEF.

My apologies for including GOST in the list - my mistake.

>2. I am very skeptical of a "text analysis software" that can't handle
>text whose collating sequence != arithmetic comparison. This means, it can't
>also handle French, German, and other language that use extra letters and
>accents, and have special collating sequences.

Its what I've got to work with (initially never intended to be
multi-lingual). Development versions use front end (keyboard) and
back-end (display) filters. The internal representation must be a
proper collating sequence so that the program can work as rapidly as
possible on LARGE volumes (Gb) of data.

Always open to constuctive dialog! Thanks for the note and the
dictionary reference. And, pardon me, but whatsa 'Doug Jones'?

Dimitri Vulis, CUNY GC Math

unread,
Apr 9, 1991, 10:27:59 AM4/9/91
to
In article <1991Apr8.1...@news.iastate.edu>, mar...@iastate.edu (Calsyn Martin Robert) writes:
>I seem to remember some back issues of the AATSEEL journal which
>contained a coding/collating sequence for Cyrillic. I have used this
>sequence in several applications. The proposed collating sequence (is
>it a standard yet?) insures that standard sorting and comparison
>algorithms result in a proper ordering of Russian words. These same
>issues outlined the Soviet and student keyboard layouts. I will attempt
>to locate these back issues and post the info here.
>
>Arkasha
That's an impossibility. In Russian collating sequence, the soft sign precedes
ya and a few other letters, while in Ukrainian it follows ya. There can be
no universal collating sequence for Cyrillic. Perhaps you should think about
the difference between the words "Russian" and "Cyrillic".

>
>--
>Martin R. Calsyn mar...@iastate.edu
>Project Vincent Systems Support gg....@isumvs.bitnet
>Iowa State University Computation Center voice: (515) 294-9889
>Ames, Iowa 50011 fax: (515) 294-1717

Dimitri Vulis, CUNY GC Math

unread,
Apr 9, 1991, 10:41:21 AM4/9/91
to
In article <1991Apr9.0...@news.iastate.edu>, mar...@iastate.edu (Calsyn Martin Robert) writes:
>In article <1991Apr8.1...@timessqr.gc.cuny.edu> d...@cunyvms1.gc.cuny.edu writes:
>>1. This is factually incorrect. The new GOST has the contiguous uppercase
>>Russian alphabet in 0xB0--0xCF, followed by the contigious lowercase
>>alphabet in 0xD0--0xEF.
>
> My apologies for including GOST in the list - my mistake.
OK. (strictly speaking, YO is out of sequence, but it's never used anyway. :)

>
>>2. I am very skeptical of a "text analysis software" that can't handle
>>text whose collating sequence != arithmetic comparison. This means, it can't
>>also handle French, German, and other language that use extra letters and
>>accents, and have special collating sequences.
>
> Its what I've got to work with (initially never intended to be
> multi-lingual). Development versions use front end (keyboard) and
> back-end (display) filters. The internal representation must be a
> proper collating sequence so that the program can work as rapidly as
> possible on LARGE volumes (Gb) of data.
TeX always applies a translation table to the input and to the output. This
helps weed out invalid characters, and lets it be ASCII inside even if it
appears to be EBCDIC or some such on the outside. I strongly urge you to
pick some of this TeX code for your system. :) :)

>
> Always open to constuctive dialog! Thanks for the note and the
> dictionary reference. And, pardon me, but whatsa 'Doug Jones'?
Sorry, I thought you knew him. :) My aplogies for snapping at you. :) :)
Doug Jones is another Iowan. :) :) :) :) :) :) :)

>
>
>--
>Martin R. Calsyn mar...@iastate.edu
>Project Vincent Systems Support gg....@isumvs.bitnet
>Iowa State University Computation Center voice: (515) 294-9889
>Ames, Iowa 50011 fax: (515) 294-1717
0 new messages