Ancient distros and non-UTF-8

34 views
Skip to first unread message

Tino Didriksen

unread,
Feb 18, 2014, 3:40:16 PM2/18/14
to constrain...@googlegroups.com
I would quite like to clean up the codebase and dependencies by using C++11 and restricting stream and grammar encodings to UTF-8, and to that end I'd like to know if...

- Is anyone using streams or grammars in encodings other than UTF-8?

- Is anyone stuck on ancient versions of GCC g++ older than 4.6?

Debian Stable and Ubuntu 12.04 LTS have 4.6.3, which are fine. Mac OS X users' g++ 4.2 doesn't count - you have XCode's clang++ instead, which is C++11 capable.

I only know of RHEL/CentOS 6.5 that has g++ 4.4.7, but since RHEL is also stuck on a too old version of CMake and nobody has complained about that, I guess RHEL's not really important.

-- Tino Didriksen
CG-3 Developer

Tino Didriksen

unread,
Feb 25, 2014, 3:14:43 AM2/25/14
to constrain...@googlegroups.com
A week on and nobody has objected.

So, nobody minds if I drop support for ISO-8859 family, UTF-16/32, BIG5, etc?

And nobody minds if distros older than Debian stable can't easily build CG-3?


-- Tino Didriksen
CG-3 Developer


Diana Santos

unread,
Feb 25, 2014, 4:15:57 AM2/25/14
to constrain...@googlegroups.com
Hi,
I absolutely mind if you drop support for ISO-8859 family, and I was
also to say that I am unfortunately also stuck with RHEL bacause it is
the Linux distribution that is supported at UiO

At Linguateca we still have a large number of corpora in ISO and we are
using vislcg3 for semantic annotation...
Sorry for this...
Diana
> --
> You received this message because you are subscribed to the Google
> Groups "Constraint Grammar" group.
> To unsubscribe from this group and stop receiving emails from it,
> send an email to constraint-gram...@googlegroups.com.
> To post to this group, send email to
> constrain...@googlegroups.com.
> Visit this group at http://groups.google.com/group/constraint-grammar
> [1].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/constraint-grammar/28e7ea94-0d4f-4e0f-aeb3-566237373aee%40googlegroups.com
> [2].
> For more options, visit https://groups.google.com/groups/opt_out [3].
>
>
> Links:
> ------
> [1] http://groups.google.com/group/constraint-grammar
> [2]
> https://groups.google.com/d/msgid/constraint-grammar/28e7ea94-0d4f-4e0f-aeb3-566237373aee%40googlegroups.com
> [3] https://groups.google.com/groups/opt_out

Tino Didriksen

unread,
Feb 25, 2014, 4:50:24 AM2/25/14
to constrain...@googlegroups.com, d.s.m....@ilos.uio.no
On Tuesday, 25 February 2014 10:15:57 UTC+1, Diana Santos wrote:
I absolutely mind if you drop support for ISO-8859 family, and I was
also to say that I am unfortunately also stuck with RHEL bacause it is
the Linux distribution that is supported at UiO

If you're using RHEL, how are you building the current version of CG-3? For the past 6 months, CG-3 has required at least CMake 2.8.0, but RHEL by default only has CMake 2.6.4.
 
At Linguateca we still have a large number of corpora in ISO and we are
using vislcg3 for semantic annotation...

Having data in ISO-8859 family is not really a problem, since you can just pipe it through "uconv -f ISO-8859-1 -t UTF-8" (or iconv, or recode).

Which is one major reason I want it out of CG-3 itself - there are better tools for stream encoding conversions (uconv, iconv, recode), so there is no good reason for CG-3 to do that job.

-- Tino Didriksen

Paul Meurer

unread,
Feb 25, 2014, 5:06:05 AM2/25/14
to constrain...@googlegroups.com
Hi,

sorry, I didn't see this anouncement before now.

We are developing on CentOS 6.5, so support for it would be important for me until CentOS 7 is out.

--
You received this message because you are subscribed to the Google Groups "Constraint Grammar" group.
To unsubscribe from this group and stop receiving emails from it, send an email to constraint-gram...@googlegroups.com.
To post to this group, send email to constrain...@googlegroups.com.

- Paul

-- 
Paul Meurer, researcher
Uni Computing, Uni Research AS
Høyteknologisenteret
Thormøhlensgate 55
N-5008 Bergen
Phone +47 55 58 97
http://www.computing.uni.no/units/clu





Trosterud Trond

unread,
Feb 25, 2014, 6:07:20 AM2/25/14
to constrain...@googlegroups.com
Tino Didriksen <Tino.Di...@gmail.com> kirjoitti 25. feb. 2014 kello 09:14:

So, nobody minds if I drop support for ISO-8859 family, UTF-16/32, BIG5, etc?
Me? No. UTF-8 is it.

And nobody minds if distros older than Debian stable can't easily build CG-3?
Me? No.

Trond.

Lars Nygaard

unread,
Feb 25, 2014, 6:26:00 AM2/25/14
to constrain...@googlegroups.com
Could people stuck on older distros use binaries compiled on newer
systems? Would this be acceptable to people, as long as new versions
are promptly publised in binary form?

-lars
> --
> You received this message because you are subscribed to the Google Groups
> "Constraint Grammar" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to constraint-gram...@googlegroups.com.
> To post to this group, send email to constrain...@googlegroups.com.
> Visit this group at http://groups.google.com/group/constraint-grammar.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/constraint-grammar/B7EF2254-AFB6-45E1-931E-AEE8F4DB15E9%40gmail.com.
Reply all
Reply to author
Forward
0 new messages