Yes.

>I tried this program:
>
> #include <stdio.h>
>
> char memory[1500000000];
>
> int main(void) {
> printf("%d\n",sizeof(memory));

> and it worked using 1.5GB of static memory without
> malloc. But 2.0GB (or 2.0GiB, whichever one is 2e9
> bytes), crashed.

Ok, well I was thinking a program would be limited to a lot
less than that. So it looks like I will be ok.

> But the data must be declared outside of a function.
> Inside a function, stack memory is very limited unless
> you declare data as static.

I think last time I declared it in Main. But as I said
before, it was only 2K. By the way, does it need to be
"unsigned char" or is "char" automatically unsigned. Seems
like it would be. Or does it depend on the compiler?

Thanks for your help.

Peabody

unread,

May 16, 2018, 5:16:29 PM5/16/18

to

Ben Bacarisse says...

> Using malloc is not hard. The only trouble is that you
> should check the result, but for programs that can get
> all space they need at the start that's simple.

So you say, but you're fluent in C. For the rest of
humanity, I suspect malloc suffers from the same issue that
plagues all normal people.

Of course I'm speaking of pointers.

bartc

unread,

May 16, 2018, 5:25:10 PM5/16/18

to

On 16/05/2018 22:10, Peabody wrote:

> By the way, does it need to be
> "unsigned char" or is "char" automatically unsigned. Seems
> like it would be. Or does it depend on the compiler?

On Windows, most compilers make 'char' signed (out of 7 compilers I
have, only one makes 'char' unsigned). Some may have a option to change
it (not lccwin32).

But it depends on what you want to do with it. If you specifically need
it unsigned or signed, then declare unsigned char or signed char. If it
doesn't matter, just use 'char'.

If it needs to be unsigned but 'unsigned char' is too long-winded, then
try this:

typedef unsigned char byte;

byte memory[500000000];

Just use 'byte' (or whatever short name you choose) in place of
'unsigned char).

--
bartc

Peabody

unread,

May 16, 2018, 5:34:56 PM5/16/18

to

Jorgen Grahn says...

> It's not about the actual limit, but about whether there
> /is/ a known limit. How do you know, in this case, that
> you need 500k, and never ever 600k?

In fact, I could need as much as 2048K. That's the maximum
firmware size of some models of the STM32F microcontrollers.

I'm trying to "help" on a project involving an electronics
device which uses an F303CC, and the problem is providing
firmware updates in encrypted form. As of now, the device
has a 12MB serial flash memory attached, and behaves as a
removable drive when it's plugged into Windows USB. So to
update, you just transfer the new file to the "drive", then
reboot into the custom bootloader and select that file to
flash.

But that file is now Intel hex, and I don't think it's a
good idea to encrypt a file with many repetitions of
"[CRLF]:10" at predictable intervals - seems like it might
make it easier to decrypt. So I was thinking of converting
the .hex file to raw .bin and then encrypting that. But
that leaves me looking for a format for the .bin file.
There would still need to be multiple segments, and each
would need a segment type, starting address, and length.
It's the length that's the problem because I'll have to go
back and fill that field only after reaching the end of it
in the .hex file. So that means keeping all of the binary
image in memory until it's finalized, and then encrypting
that.

Well, that probably doesn't make any sense. But it seems I
may also have an endianness issue with the length field.

Maybe encrypting a .hex file wouldn't be so bad after all.

Keith Thompson

unread,

May 16, 2018, 5:45:41 PM5/16/18

to

Peabody <waybackNO...@yahoo.com> writes:
[...]

> I think last time I declared it in Main.

"main", not "Main". C is case-sensitive. It's a good idea to get into
the habit of being precise.

> But as I said
> before, it was only 2K. By the way, does it need to be
> "unsigned char" or is "char" automatically unsigned. Seems
> like it would be. Or does it depend on the compiler?

It depends on the compiler.

"char", "signed char", and "unsigned char" are all distinct types.
Plain "char" has the same size and representation as either "signed
char" or "unsigned char"; the choice is implementation-defined.
Don't write code that assumes one or the other.

Use "char" when you want to store character data. Use "signed char"
when you want a very small integer type (not a common need). Use
"unsigned char" when you want to store raw data, usually in arrays.

--
Keith Thompson (The_Other_Keith) ks...@mib.org <http://www.ghoti.net/~kst>
Working, but not speaking, for JetHead Development, Inc.
"We must do something. This is something. Therefore, we must do this."
-- Antony Jay and Jonathan Lynn, "Yes Minister"

Keith Thompson

unread,

May 16, 2018, 7:16:02 PM5/16/18

to

bartc <b...@freeuk.com> writes:
[...]

> If it needs to be unsigned but 'unsigned char' is too long-winded, then
> try this:
>
> typedef unsigned char byte;
>
> byte memory[500000000];
>
> Just use 'byte' (or whatever short name you choose) in place of
> 'unsigned char).

"unsigned char" is not too long-winded. Any C programmer will know
exactly what it means. Any C programmer seeing "byte" will guess
that it's *probably* unsigned char, but will have to track down
the declaration to be sure.

Keith Thompson

unread,

May 16, 2018, 7:17:41 PM5/16/18

to

Peabody <waybackNO...@yahoo.com> writes:
[...]

> But that file is now Intel hex, and I don't think it's a
> good idea to encrypt a file with many repetitions of
> "[CRLF]:10" at predictable intervals - seems like it might
> make it easier to decrypt.

[...]

Not really a topic for this newsgroup, but any decent modern
encryption algorithm shouldn't have this kind of problem.

bartc

unread,

May 16, 2018, 7:45:39 PM5/16/18

to

On 17/05/2018 00:15, Keith Thompson wrote:
> bartc <b...@freeuk.com> writes:
> [...]
>> If it needs to be unsigned but 'unsigned char' is too long-winded, then
>> try this:
>>
>> typedef unsigned char byte;
>>
>> byte memory[500000000];
>>
>> Just use 'byte' (or whatever short name you choose) in place of
>> 'unsigned char).
>
> "unsigned char" is not too long-winded.

That's your opinion. Mine is that:

unsigned char* Strfn(const unsigned char* s, const unsigned char* t)
{
unsigned char* r;
....
}

(with a few consts thrown as people like using them)
looks a lot more long-winded and a nuisance to type than:

char* Strfn(char* s, char* t) {
char* r;
....
}

I think many will agree with me otherwise stdint.h would have defined
unsigned_int32_t rather than uint32_t.

Any C programmer will know
> exactly what it means. Any C programmer seeing "byte" will guess
> that it's *probably* unsigned char, but will have to track down
> the declaration to be sure.

They will have to anyway when people use typedefs, and they tend to do
that a lot. Tiny C uses 300 of them. Sqlite about 2000. While windows.h
seems to consist of little else. Plus all the macros you also need to
look up to find out what they do.

Really, using a 'byte' typedef would be the least of anyone's problems
when trying to read other people's code.

But by all means continue writing 'const unsigned long long int' in your
own code if you think it is better.

--
bartc

Ben Bacarisse

unread,

May 16, 2018, 7:54:45 PM5/16/18

to

As a first approximation, you replace:

int main(void)
{
char array[1000];
/* code that uses array[0] to array [999] */
}

with

#include <stdlib.h>

int main(void)
{
char *array = malloc(1000 * sizeof *data);
if (array != NULL) {
/* code that uses array[0] to array [999] */
}
}

The magic "1000 * sizeof *data" is the way to ask for room for 100 of
whatever data points to (in this case characters). The reason this is a
good idea is that it works for any type. If you need and array of a
million ints you write

int *many = malloc(1000000 * sizeof *many);

instead.

I'm saying this only because learning new stuff is usually a good thing.
My original post suggested that you probably won't need to do this for a
0.5MB array, particularly if you declare the array outside of main (or
inside with the static keyword).

--
Ben.

Keith Thompson

unread,

May 16, 2018, 8:09:52 PM5/16/18

to

Ben Bacarisse <ben.u...@bsb.me.uk> writes:
[...]

> #include <stdlib.h>
>
> int main(void)
> {
> char *array = malloc(1000 * sizeof *data);

You meant
char *array = malloc(1000 * sizeof *array);
or
char *data = malloc(1000 * sizeof *data);

(Naming a pointer object "array" could be confusing.)

[...]

Ian Collins

unread,

May 16, 2018, 9:27:37 PM5/16/18

to

On 17/05/18 11:45, bartc wrote:
> On 17/05/2018 00:15, Keith Thompson wrote:
>> bartc <b...@freeuk.com> writes:
>> [...]
>>> If it needs to be unsigned but 'unsigned char' is too long-winded, then
>>> try this:
>>>
>>> typedef unsigned char byte;
>>>
>>> byte memory[500000000];
>>>
>>> Just use 'byte' (or whatever short name you choose) in place of
>>> 'unsigned char).
>>
>> "unsigned char" is not too long-winded.
>
> That's your opinion. Mine is that:
>
> unsigned char* Strfn(const unsigned char* s, const unsigned char* t)

Which is why we use unit8_t.

--
Ian

Peabody

unread,

May 16, 2018, 11:51:55 PM5/16/18

to

Keith Thompson says...

>> But that file is now Intel hex, and I don't think it's
>> a good idea to encrypt a file with many repetitions of
>> "[CRLF]:10" at predictable intervals - seems like it
>> might make it easier to decrypt. [...]

> Not really a topic for this newsgroup, but any decent
> modern encryption algorithm shouldn't have this kind of
> problem.

I was thinking RC4, which is not exactly modern, but still
widely used despite its shortcomings. Well since hex file
parsing is already implemented in the custom bootloader,
I'll just stick with encryping the hex file. However, I
just got home from my local open source hardware meetup, and
the guys there say there's little point in bothering with
encryption. For chips like the STM32, cloners just decap
the chip and read out the current contents.

already...@yahoo.com

unread,

May 17, 2018, 4:55:17 AM5/17/18

to

On Thursday, May 17, 2018 at 12:45:41 AM UTC+3, Keith Thompson wrote:
>
> Use "signed char" when you want a very small integer type (not a common need).
>

IMHO, using "signed char" for small integers make your intentions less clear than int8_t. int_fast8_t probably expresses intentions even better, but it looks ugly.
I regard "signed char" as a legacy type which only purpose by now should be interfacing with old code and libraries.

Ben Bacarisse

unread,

May 17, 2018, 7:28:41 AM5/17/18

to

Keith Thompson <ks...@mib.org> writes:

> Ben Bacarisse <ben.u...@bsb.me.uk> writes:
> [...]
>> #include <stdlib.h>
>>
>> int main(void)
>> {
>> char *array = malloc(1000 * sizeof *data);
>
> You meant
> char *array = malloc(1000 * sizeof *array);
> or
> char *data = malloc(1000 * sizeof *data);

Yes, thanks.

--
Ben.

Scott Lurndal

unread,

May 17, 2018, 9:11:25 AM5/17/18

to

Peabody <waybackNO...@yahoo.com> writes:
>Jorgen Grahn says...
>
> > It's not about the actual limit, but about whether there
> > /is/ a known limit. How do you know, in this case, that
> > you need 500k, and never ever 600k?
>
>In fact, I could need as much as 2048K. That's the maximum
>firmware size of some models of the STM32F microcontrollers.
>
>I'm trying to "help" on a project involving an electronics
>device which uses an F303CC, and the problem is providing
>firmware updates in encrypted form. As of now, the device
>has a 12MB serial flash memory attached, and behaves as a
>removable drive when it's plugged into Windows USB. So to
>update, you just transfer the new file to the "drive", then
>reboot into the custom bootloader and select that file to
>flash.
>
>But that file is now Intel hex, and I don't think it's a
>good idea to encrypt a file with many repetitions of
>"[CRLF]:10" at predictable intervals - seems like it might
>make it easier to decrypt.

If you use a good modern encryption algorithm, that
won't be a problem.

Mr. Man-wai Chang

unread,

May 17, 2018, 9:48:45 AM5/17/18

to

You need operating system API calls to really monitor memory usage...
it's not just really a C issue. C programs run inside an operating system.

--
@~@ Remain silent! Drink, Blink, Stretch! Live long and prosper!!
/ v \ Simplicity is Beauty!
/( _ )\ May the Force and farces be with you!
^ ^ (x86_64 Ubuntu 9.10) Linux 2.6.39.3
不借貸! 不詐騙! 不賭錢! 不援交! 不打交! 不打劫! 不自殺! 不求神! 請考慮綜援
(CSSA):
http://www.swd.gov.hk/tc/index/site_pubsvc/page_socsecu/sub_addressesa

Malcolm McLean

unread,

May 17, 2018, 11:14:19 AM5/17/18

to

The stack (where local variables and arrays go) is designed for holding
small amounts of data. Anything over about 1K you should consider whether
it is too large.
The alternatives are malloc() and free(), or declaring a large static pool
of memory. The advantage of malloc() is that it is scaleable to the
maximum amount of memory in your system, and you can free the memory
for reuse when you have done with it. The disadvantages are that it
can return null, and some implementations "soft crash" by paging out
large allocations to disk - usually this isn't a viable solution
as it then takes too long to execute.
A static buffer you are guaranteed to have, but you've got to specify
a size. It's not so friendly to other programs if you choose a very
large size to handle a rare exceptional case - whether that matters
or not just depends. It may be that the computer is primarily running
your program and isn't a general-purpose machine for web browsing and
clerical work. It then doesn't matter if other programs have constrained
resources. However it may be that your program is just one amongst
many, so excessive resource usage will make it unpopular.

If the static buffer exceeds some memory limit, the program won't load
at all. Again, that might be what you want, or it might be a disaster.

David Brown

unread,

May 17, 2018, 2:24:04 PM5/17/18

to

I agree, except when you need greater portability. "signed char" is
always the smallest signed integer type supported by the platform. It
is the same (baring seriously weird theoretical implementations) as
"int_least8_t". "int_fast8_t" may well be bigger than a "signed char" -
on some platforms it will be as big as 32 bits.

In many situations where you need a very small integer type (as Keith
says, these are not that common), then very often int8_t expresses your
needs accurately - typically you /do/ want to be sure it is 8-bit to fit
in with file formats, network structures, etc.

David Brown

unread,

May 17, 2018, 2:34:13 PM5/17/18

to

On 16/05/18 23:34, Peabody wrote:
> Jorgen Grahn says...
>
> > It's not about the actual limit, but about whether there
> > /is/ a known limit. How do you know, in this case, that
> > you need 500k, and never ever 600k?
>
> In fact, I could need as much as 2048K. That's the maximum
> firmware size of some models of the STM32F microcontrollers.
>

Just to be clear here, are you writing the C code for the
microcontroller or for the PC?

If you are writing for a microcontroller, you should /strongly/ prefer
static allocation (either at file scope, or at as a "static" within a
function) because it gives you a much clearer picture of the memory
requirements of your project. Your linker will catch errors, and your
map file will show detailed memory information.

(If you want to know more here, just ask - but there is no point in my
giving details if it is not helpful.)

> I'm trying to "help" on a project involving an electronics
> device which uses an F303CC, and the problem is providing
> firmware updates in encrypted form. As of now, the device
> has a 12MB serial flash memory attached, and behaves as a
> removable drive when it's plugged into Windows USB. So to
> update, you just transfer the new file to the "drive", then
> reboot into the custom bootloader and select that file to
> flash.
>
> But that file is now Intel hex, and I don't think it's a
> good idea to encrypt a file with many repetitions of
> "[CRLF]:10" at predictable intervals - seems like it might
> make it easier to decrypt. So I was thinking of converting
> the .hex file to raw .bin and then encrypting that. But
> that leaves me looking for a format for the .bin file.

Intel hex format files have much lower entropy than a raw binary file
would have. Yes, that /can/ make it easier to decrypt - but if you pick
a decent algorithm there will be no problem. As long as it is cheaper
and easier for the bad guys to go around to your house and threaten you
into telling them the encryption key, your encryption algorithm is
strong enough!

However, it is a lot more efficient to use a binary format - it will be
about a third of the size of the Intel Hex format.

Raw binary images for microcontroller firmware don't usually have much
of a format as such - they are just memory images. You have the
interrupt vectors and other device-specific features at certain fixed
addresses in the image - the rest is just the program code. But it can
be worth adding a few extra bits and pieces in a structure at a fixed
spot - typically I include the length of the image, a CRC check, a
marker or identifier for the program type or card type (so that your
update system doesn't put a program for a radio into your CD player, or
vice versa), and perhaps a copyright string and version information.

> There would still need to be multiple segments, and each
> would need a segment type, starting address, and length.

Nope. You generally don't need that, unless the program is split into
many parts in memory. In a microcontroller like an STM32F, all the
flash is contiguous. (You don't need any ram segments in the image.)

> It's the length that's the problem because I'll have to go
> back and fill that field only after reaching the end of it
> in the .hex file. So that means keeping all of the binary
> image in memory until it's finalized, and then encrypting
> that.
>
> Well, that probably doesn't make any sense. But it seems I
> may also have an endianness issue with the length field.

Go little-endian all the way.

unread,

May 18, 2018, 11:12:58 AM5/18/18

to

Just some names Steven Petruzzellis has used
"Evil" John *
"Evil" Snit *
Big Crotch on a Small Fish
Cornelius Munshower
CSMA Moderator
Edward Stanfield
Fretwiz *
Hitman Hero
Measles
Petruzzellis Kids
Sigmond
Slaveen
Smit
Steve C *
Steve Camoll *
Steve Carroll <noone@xxxxxxxxxxx> *
Steve Carroll <stevecarroll@xxxxxxxxxxx> *
Steve Carroll <trollkiller@xxxxxx> *
Steve Carroll's Dog *
Steve Carrolll *
Steve Carrroll *
Yevette Owens
Yobo_Obyo

So, yeah, I buy into my own fiction, fully knowing it would never be discovered by others, because it makes me reconsider my program, improving it. Larry Washington's posts arein fact all unfair. There's zero dispute that as soon as any released 'plonkee' does one thing to wound the poor crybaby's feelings that they'll be blocked again.

My view is much more sophisticated.

Larry Washington doesn't have any idea what he is sniveling about. Sandman makes things up. Here is a list of names Jonas Eklundh has admitted he attributes to Larry Washington "Cactus Pete", "Donald", "Donald Miller", "Horace McSwain", "Hymen", "meat", "Mike Weaver", "Modena IV Drid", "Omar Murad Asfour", "Rhino Plastee", "Soapy", "SopwithCamel", "Sunny Day", "Takuya Saitoh", "The Letter Q", "tmelmosfire", "zevon".

--
Do not click this link!
https://redd.it/6sfhq6
Jonas Eklundh

Nick Bowler

unread,

May 18, 2018, 3:59:59 PM5/18/18

to

On Thu, 17 May 2018 00:54:37 +0100, Ben Bacarisse wrote:
> The magic "1000 * sizeof *data" is the way to ask for room for 100 of
> whatever data points to (in this case characters). The reason this is a
> good idea is that it works for any type. If you need and array of a
> million ints you write
>
> int *many = malloc(1000000 * sizeof *many);
>
> instead.

Well, except this will silently do the wrong thing if the multiplication
result doesn't fit in a size_t. The above will work for a sufficiently
large size_t and a sufficiently small int but it certainly doesn't work
in general for "any type".

Moreover depending on the type of 1000000 and the rank of size_t the
multiplication might have a signed result, and without any checks that
could potentially overflow (=> undefined behaviour).

So in the general case one usually needs to write something like:

if (n > SIZE_MAX / sizeof *many) {
/* result won't fit in size_t, fail */
}

many = malloc(n * sizeof *many);
if (!many) {
/* malloc error, fail */
}

Alternately one can use calloc, which doesn't require this multiplication.
But in the real world many implementations exist which have the same bug
in their calloc implementation, so your mileage may vary...

supe...@casperkitty.com

unread,

May 18, 2018, 4:55:03 PM5/18/18

to

On Friday, May 18, 2018 at 2:59:59 PM UTC-5, Nick Bowler wrote:
> Alternately one can use calloc, which doesn't require this multiplication.
> But in the real world many implementations exist which have the same bug
> in their calloc implementation, so your mileage may vary...

There are also many real-world implementations where malloc() or calloc()
might allocate address space without immediately committing physical memory.
This may improve physical memory utilization in cases where programs use
malloc() to acquire a huge block of memory but only use a tiny part of it,
but may cause programs to arbitrarily fail the first time they access a
particular piece of an allocated region if the amount of available RAM has
fallen since it was allocated.

IMHO, it would be helpful to categorize implementations based upon what they
promise with regard to the behavior of malloc() and related functions. The
idea that functions should return null when out of memory is sometimes useful
on platforms that can uphold such a guarantee, but treating it as requirement
would make it impossible to implement C on some platforms.

Peabody

unread,

unread,

May 18, 2018, 8:43:08 PM5/18/18

to

You have nimi, it generates C code and has different syntax ;)

Keith Thompson

unread,

May 18, 2018, 10:12:43 PM5/18/18

to

Melzzzzz <Melz...@zzzzz.com> writes:
> On 2018-05-19, bartc <b...@freeuk.com> wrote:

[...]

>> The OP apparently doesn't like C's syntax.
>>
>> What's hard to understand is why it is not possible have a language that
>> does exactly what C does, but with an alternative syntax. There are a
>> few different ones of which C's terse syntax with braces and lots of
>> punctuation is one.
>>
>> (Although I still doubt it would endear itself to me as there are other
>> matters besides syntax.)
>>
> You have nimi, it generates C code and has different syntax ;)

Do you mean Nim?

Melzzzzz

unread,

May 18, 2018, 10:34:13 PM5/18/18

to

On 2018-05-19, Keith Thompson <ks...@mib.org> wrote:
> Melzzzzz <Melz...@zzzzz.com> writes:
>> On 2018-05-19, bartc <b...@freeuk.com> wrote:
> [...]
>>> The OP apparently doesn't like C's syntax.
>>>
>>> What's hard to understand is why it is not possible have a language that
>>> does exactly what C does, but with an alternative syntax. There are a
>>> few different ones of which C's terse syntax with braces and lots of
>>> punctuation is one.
>>>
>>> (Although I still doubt it would endear itself to me as there are other
>>> matters besides syntax.)
>>>
>> You have nimi, it generates C code and has different syntax ;)
>
> Do you mean Nim?

Yes, Nim. It was typo ;p
Nim has advantage that you can compile into C and then carry C code
where it is needed.

Tim Rentsch

unread,

May 19, 2018, 12:32:35 AM5/19/18

to

Keith Thompson <ks...@mib.org> writes:

> bartc <b...@freeuk.com> writes:
> [...]
>
>> If it needs to be unsigned but 'unsigned char' is too long-winded, then
>> try this:
>>
>> typedef unsigned char byte;
>>
>> byte memory[500000000];
>>
>> Just use 'byte' (or whatever short name you choose) in place of
>> 'unsigned char).
>

> "unsigned char" is not too long-winded. [...]

Obviously some people don't agree with that opinion.

Tim Rentsch

unread,

May 19, 2018, 2:52:33 AM5/19/18

to

bartc <b...@freeuk.com> writes:

> On 18/05/2018 23:10, supe...@casperkitty.com wrote:
>
>> On Friday, May 18, 2018 at 4:33:13 PM UTC-5, Peabody wrote:
>>
>>> Where is Pascal when we really need it? :-)
>>
>> It got displaced by C for a couple of reasons:
>>
>> 1. While many Pascal implementations offered enough features to make
>> the language usable for practical purposes, they did so inconsistently,
>> because nothing in the design of the language suggested how they should
>> work. The design of the C language, by contrast, often offered some
>> strong clues.
>>
>> For example, in Turbo Pascal, the syntax to write 0x1A01 to the 32 bits
>> at segment 0xB800 offset 160 is "MemW[$B800:160] := $1A01;", but there
>> is nothing in Pascal itself that would suggest the identifier MemW, nor
>> the use of two numbers separated by a colon, as a means of performing
>> such an access. In an 8088 implementations of C configured to use 32-bit
>> pointers by default, the syntax is "*((unsigned*)0xB80000A0) = 0x1A01;".
>> The design and historical of the language suggested that conversion
>> from an integer type to a same-size pointer should be representation-
>> preserving, and that 8088 implementations should use 16-bit "int".
>
> The OP apparently doesn't like C's syntax.
>
> What's hard to understand is why it is not possible have a language
> that does exactly what C does, but with an alternative syntax.

It isn't hard to understand at all. It certainly is possible
to do, but no one has done it in a way that you like, because
your ideas are so screwy.

bartc

unread,

May 19, 2018, 7:15:14 AM5/19/18

to

On 19/05/2018 07:52, Tim Rentsch wrote:

> bartc <b...@freeuk.com> writes:

>> The OP apparently doesn't like C's syntax.
>>
>> What's hard to understand is why it is not possible have a language
>> that does exactly what C does, but with an alternative syntax.
>
> It isn't hard to understand at all. It certainly is possible
> to do, but no one has done it in a way that you like, because
> your ideas are so screwy.

Really? There might be reasons for why so many 'easy' languages eschew
C-style syntax and go for something which is easier on the eye (Python,
Ruby, Lua, ...)

Think how much more productive programmers could be, how much less error
prone their code could be, how much more enjoyable work might be, but
no, the idea is too preprosterous...

Steve Carroll

unread,

May 19, 2018, 10:22:42 AM5/19/18

to

The fabricator AKA Snit does it every time. Then the deluge begins. Because the mama's boy just has to run to other groups.

And given how frequently it is clear that Snit's .sig file is some twist of a quote Melzzzzz posted which had been a beating on Snit for something he did which was absurd/fallacious/etc... its undoubtedly a regular demonstration of Snit's continuing distress for having been so frequently humiliated. So what is Snit's system for the flooding deluge? SC? That is the only database he knows, at least that I have seen. He must be abusing it to write these absurd "attack" threads. Lemme guess, Snit took some code that he stole and pretended was his... he's feeding it Melzzzzz's posts, grabbing offensive paragraphs, then re-writing those using a Markv model and then he actively posts them because his mental health issues enables him to do that essentially ceaselessly. That's what Snit does when he gets cornered. He immediately creates a nym, starts a cross posted thread so he can claim here made a mistake. Don't blame me it was my left hand... and then he responds to it with his right hand. By getting an education from 'social warriors' like that you get concepts like 'reverse racism'. Carried to its (il)logical intent, the idea that it's 'partisan' for a straight male to not wish to have sex with a transvestite is created.

Having to tolerate the use of C+ it can hardly be called "free" when you include your time. Melzzzzz's timely statement stands proper and correct.

--
My Snoring Solution
https://youtu.be/IhOfBmWwCVY
https://youtu.be/u4xD43Khhkw
http://www.5z8.info/lemon-party-redux_v1o7pm_nakedgrandmas.jpg
Jonas Eklundh

David Brown

unread,

May 19, 2018, 12:01:02 PM5/19/18

to

On 18/05/18 23:33, Peabody wrote:
> David Brown says...
>
> > And don't write the PC program in C, unless are a very
> > experienced C programmer and know no other programming
> > languages. C (on a PC) has no problem with large lumps
> > of data, statically or dynamically allocated. But it is
> > a hopeless language for something like parsing Intel Hex
> > files or other string manipulation. You'll spend far
> > more time trying to figure out your buffers, memory
> > allocations, data structures, etc., than actually
> > solving the problem. Use a language that has direct
> > support for strings, lists, pattern matching, etc. It
> > will be a fraction of the development effort.
>
> Too late. Well, I'm obviously not experienced in C. But
> all my programming experience in the past has been with
> assembler - for 6502, for x86(DOS), and MSP430. That's
> if you don't count Fortran and Basic. But last year I
> needed to modify the source of a TI program called BSLDEMO,
> which is the Windows side of some MSP430 parts' BSL setup.
> So I had to learn enough about C to do that and recompile.
>

There is plenty of Python code for working with the BSL too (it is the
main language used by the open source msp430 development tools, which
are far better than the stuff TI wrote themselves). But as you say, it
is too late now - and anyway, I doubt if you want to learn yet another
language at the moment.

> Then I had to write my own program to do the BSL thing for
> certain other MSP430 parts which use a custom bootloader. I
> could have written it in VBscript, except that VBscipt
> doesn't do COM ports. So really the only option I had was
> C. And I had some very helpful ANs from Silabs about
> talking to COM ports in C that made it possible.
>
> It's not pretty code, but I managed to parse not only Intel
> Hex but also TI-TEXT files, then send the binary to the MCU.
> But those were tiny files, unlike the the ones for STM32F's
> here.

Well, as has been pointed out, C doesn't care about the sizes here - you
can happily use 2 MB arrays in C on a PC without a second thought. (It
would be different in C on the msp430 or the STM32F, of course.)

>
> But I have to agree that the logic of the task at hand was
> not the big stumbling block. It was the C notation and
> jargon. And it's all so terse that it's not clear what's
> going on.
>
> Where is Pascal when we really need it? :-)
>

Delphi is alive and well for Windows. And you can work with comms ports
from Delphi without problem. I don't think it is significantly better
than C for this sort of thing, but of course that is a matter of
familiarity. And if you want a nice gui, the Delphi is a very
convenient way to make one.

David Brown

unread,

Medium-complexity compilers would probably have a better shot at
optimizing accesses to a non-zero-based array than they would with code
in C that adjusts indices on every access. The fact that an array isn't
zero-based would give a strong hit to a compiler that in contexts where
the address of the pointer itself isn't exposed to outside code, it
would likely benefit from using an adjusted pointer. A sufficiently
sophisticated C compiler might happen to notice that array "foo" is
always accessed with a subscript of "i-42", but only if it spends a fair
amount of time looking for a lot of similar potential optimizations that
don't pan out.

> Checking for ranges can often be done at compile time - thus
> "my_workday_hours[Sunday]" will give a compile-time error. Whether
> ranges are checked at run-time, or are undefined behaviour, is a matter
> of implementation flags. Pascal compilers usually support both choices.
>
> In C, for comparison, you don't have a way to express this sort of
> thing, and don't have a way to get compile-time checking of errors that
> could be checked at compile-time. (Some compilers can do so to some
> extent.)

> > Better enumerated types, though... yes. They'd only be useable if you
> > cannot assign anything to them which isn't the same type, though. So no
> > cross-assigning integers and enums.
> >
>
> That is the case for Pascal enumerated types. They are independent
> types, rather than just a collection of int-compatible constants.

A useful common extension is to allow conversions between enumerated
types and integer types using the syntax newType(valueToBeConverted).
Whether a compiler would trap an attempt to compute weekdays(0) or
weekdays(123) would typically be an implementation setting.

BTW, there are two different kinds of unsigned semantics which are
useful for different purposes; both Pascal and C would benefit from
having explicit types of each kind. Unsigned types are sometimes
used to represent numbers, and sometimes to represent members of an
abstract algebraic ring of integers congruent mod 2**N. Having
values that can be treated as a ring is useful, but it precludes the
possibility of useful overflow/bounds checks. If a 16-bit value is
being used as an algebraic ring (e.g. for checksum computation),
having to jump through hoops to avoid overflow trapping is not helpful.
On the other hand, if it's being used to store a quantity, trapping
on an attempt to store a value outside the range 0-65535 may be very
useful. If I were designing a language, I'd include both kinds of
values.

David Brown

unread,

May 20, 2018, 6:09:16 AM5/20/18

to

On 19/05/18 20:03, bartc wrote:
> On 19/05/2018 17:32, David Brown wrote:
>
>> In Pascal, you can write something like:
>>
>> type
>>      days_of_week = (Sunday, Monday, Tuesday, Wednesday, Thursday,
>> Friday, Saturday);
>>      weekdays = Monday ... Friday;
>>
>>      workday_hours = array[weekdays] of 0 .. 24;
>>
>> var
>>      my_workday_hours : workday_hours;
>>
>> Accessing elements of "my_workday_hours" is /exactly/ as efficient as
>> accessing an array in C. Doing so via a pointer is marginally less
>> so, because it requires an offset (since the index does not start at
>> 0) that probably can't be optimised away at compile time.
>
> In both cases (direct array and via a pointer), at worst there will be a
> constant offset when the lower bound is not zero, but that can usually
> be incorporated into the address mode of the instruction that accesses
> the element (depends on instruction set).
>

Yes. Such arrays may have a slight extra cost compared to 0-based
arrays, but it is at most a very slight extra cost.

> And a compiler could eliminate that by having the address of the array
> pointing to an imaginary 0th element rather than the first element.
>

Possibly - that may involve other complications since the pointer no
longer points at the object itself. Certainly it is a conceivable
optimisation, and this is all a matter of implementation details.

Spiros Bousbouras

unread,

May 20, 2018, 3:05:35 PM5/20/18

to

On Fri, 18 May 2018 10:28:38 +0200
David Brown <david...@hesbynett.no> wrote:
> On 18/05/18 01:36, Peabody wrote:
> > David Brown says...
> >
> > > Just to be clear here, are you writing the C code for
> > > the microcontroller or for the PC?
> >
> > I'm doing a demonstration program for the PC - both an
> > encryption program and a decryption program. Just to make
> > sure I understand the algorithm properly.

[...]

> And don't write the PC program in C, unless are a very experienced C
> programmer and know no other programming languages. C (on a PC) has no
> problem with large lumps of data, statically or dynamically allocated.
> But it is a hopeless language for something like parsing Intel Hex files
> or other string manipulation. You'll spend far more time trying to
> figure out your buffers, memory allocations, data structures, etc., than
> actually solving the problem. Use a language that has direct support
> for strings, lists, pattern matching, etc. It will be a fraction of the
> development effort.

I very much disagree with this. I had a look at en.wikipedia.org/wiki/Intel_HEX
and the format is easy to parse ; it would make a pleasant exercise. The OP
stated in <20180518-2...@Peabody.us.newsgroupdirect.com> that he has
experience with several other languages so this particular project would be an
excellent way to get more experience in C. Now if someone has limited programming
experience *in general* then yes , this would be perhaps too ambitious.
There are already libraries in C for "strings, lists, pattern matching" but
implementing any of those on one's own is a rewarding exercise. And I'm sure
there are libraries for parsing the HEX format ; the wikipedia article gives a
link to one.

In particular , the statement that C "is a hopeless language for" [..] "string
manipulation" is absurd. The standard library is not much good but there are
plenty of other libraries and again , writing one's own is a pleasant and
instructive exercise. One can experiment with different data structures ,
methods of allocation , etc. Not many languages provide this level of fine
control and writing the code while having that level of fine control is a
satisfying experience. Perhaps it takes a certain kind of brain to find this
satisfying but then we don't know if the OP has or hasn't this kind of brain.
He has said that he has experience with assembler and that is close to C.

> Usually you shouldn't even bother with hex file parsing - just use
> objcopy, which will be part of the development tools for the
> microcontroller. It will convert your file to a nice raw binary image.

--
These whole systems of dark planets, those trillions of square kilometres of blank
paper, represented the Mind's future; the spaces it would fill in its life to come.
If it had one.
"Consider Phlebas"

Malcolm McLean

unread,

May 21, 2018, 6:00:06 AM5/21/18

to

On Sunday, May 20, 2018 at 8:05:35 PM UTC+1, Spiros Bousbouras wrote:
>
> In particular , the statement that C "is a hopeless language for" [..] "string
> manipulation" is absurd.
>

When you work with DNA data, strings become very long, and the simple algorithms
for searching and matching become far too slow.
In fact it can be other languages which are hopeless for implementing
something like a suffix tree. C can do it reasonably well and with space
efficiency. I haven't tried it in, say, Perl or Matlab, but I think you'd
resort to linking a C subroutine.

David Brown

unread,

May 21, 2018, 8:53:43 AM5/21/18

to

My understanding is that he is using C because he has no choice of
language for handling a required task - not because he wants to learn C.

For someone wanting to /learn/ general C programming, I agree that an
Intel Hex format parser might be a fine project. But if you are looking
at it from the other viewpoint - you need an Intel Hex format parser and
want to pick a language, then C would be a crazy choice (ignoring any
other of a myriad of reasons for or against choosing any particular
language).

> Now if someone has limited programming
> experience *in general* then yes , this would be perhaps too ambitious.
> There are already libraries in C for "strings, lists, pattern matching" but
> implementing any of those on one's own is a rewarding exercise. And I'm sure
> there are libraries for parsing the HEX format ; the wikipedia article gives a
> link to one.
>
> In particular , the statement that C "is a hopeless language for" [..] "string
> manipulation" is absurd.

I'll accept that it is an exaggeration. And I agree that there are
libraries to aid you here. But for languages that have a good string
handling along with native lists and other high-level structures, this
is all just a few lines of standard language code. In C, you are
researching libraries, finding out how to use them, etc. C is a
language in which you /can/ do practically anything - that does not mean
it is a good choice of language for all tasks.

supe...@casperkitty.com

unread,

Ten times slower than what?

My dynamic example does exactly that, yet was 10,000 times faster than
optimised C code (using my usual interpreter), despite the language used
normally being 20 times slower.

Counted strings work!

--
bartc

supe...@casperkitty.com

unread,

May 21, 2018, 1:57:33 PM5/21/18

to

On Monday, May 21, 2018 at 12:27:57 PM UTC-5, Bart wrote:
> Real uses won't fit into that such a tidy loop, where you know the final
> length of the string among other things, and always know the current
> length (otherwise this could be done with memset().)
>
> Imagine needing to implement this function:
>
> void appendchar(char* s, char c) { // assume adequate capacity
> ...
> }
>
> without any other information about s. The simplest implementation
> involves scanning the string from the beginning.

Or, slightly more generally, a function that will need to append some
amount of text not known in advance, to a buffer which might or might
not be large enough or resizable, and which must fail in deterministic
fashion if the buffer isn't large enough and isn't resizable.

Using the kind of prefixed strings I advocate, it would be fairly simple.

// Every string-type structure would have a field "str" of array type.

#define appendFormattedLong(dest, n) appendString_((dest)->str, n)

void appendFormattedLong_(STRING *dest, uint64_t n)
{
char contentToAppend[64];
unsigned len = sprintf(contentToAppend, "%llu", n);

readableString src = makeReadableString(contentToAppend, len);

concatString(dest, &src);
}

This function would work equally well with fixed-sized buffers that are
statically allocated or embedded in other structures, or with dynamically-
allocated resizable buffers. All of the complexity would be encapsulated
in makeReadableString (which would simply populate some fields of a
structure and return it), and setStringLength, so functions like the
above wouldn't need to know or care about kinds or sizes of buffers
supplied by client code, and any code which needed to pass around strings
without manipulating the internals itself could just pass around strings.

Scott Lurndal

unread,

May 21, 2018, 2:30:00 PM5/21/18

to

"Equally well" in this case means performs poorly.

Strings stored as length-field + storage have been around
for fifty years or more, and are nothing new. Your proposal
to store the length as variable length ASCII is, however new. And a really
bad idea.

A real string has four bounds: Start of storage, end of storage,
start of string within storage, end of string within storage.

See for example the vax MOVC5 or the Burroughs MVS/CPS/HSH instructions.

bartc

unread,

May 21, 2018, 2:32:07 PM5/21/18

to

On 21/05/2018 17:40, supe...@casperkitty.com wrote:

> A good general-purpose string library could accommodate a number of string
> formats, identifiable via prefix byte, and supply a couple functions which
> given a pointer to a string's header, could fill in one of the following
> structures:
>
> struct readableString {
> uint8_t typeMarker[1];
> char *dat;
> size_t length;
> }
>
> or
>
> struct writableString {
> uint8_t typeMarker[1];
> char *dat;
> size_t length;
> RESIZER_INFO *block_control;
> }
>
> (the choice being based upon whether code would be writing to the string).

The choices would be between strings that are always a fixed length (and
which can still be writeable, or mutable, as they usually are in C), and
strings that can grow, or sometimes reduce.

Note that on a 64-bit machine, that prefix byte would occupy an 8-byte
slot, so the first takes 24 bytes and the second 32 bytes; not really
much to choose between them.

(I could create descriptors for both within 16 bytes, even with 64-bit
pointers, but it would limit the length of a single string to 4GB.
Longer strings can just about be accommodated with 16 bytes, but gets
more fiddly and less efficient.)

> Functions which expect to read from a string would be able to accept
> either of the above, a length-prefixed constant string, or a length-prefixed
> fixed-size buffer.

So two kinds of string descriptor, and two kinds of pointers to actual
strings which use a prefix byte(s) inline with the actual data?

That sounds too much, and would make it a headache to write functions
that take such a mix of parameters, as the second two require the string
pointer and the string length to be accessed in a different way.

It might be OK, if this is part of a string type in a dynamic language,
where there other overheads already.

But to make life simpler, I think there should be one kind of descriptor
with .dat and .length always accessed the same way.

--
bartc

supe...@casperkitty.com

unread,

May 21, 2018, 3:08:52 PM5/21/18

to

On Monday, May 21, 2018 at 1:30:00 PM UTC-5, Scott Lurndal wrote:
> "Equally well" in this case means performs poorly.

Better than zero-terminated strings in many cases, and better than other
formats in the only case where zero-terminated strings beats them.

> Strings stored as length-field + storage have been around
> for fifty years or more, and are nothing new. Your proposal
> to store the length as variable length ASCII is, however new. And a really
> bad idea.

What do you mean "ASCII"? It wouldn't be stored as a sequence of digits.
While processing a variable-length encoding is a bit less efficient than
processing a length stored in a single word, it avoids the need to either
set a hard maximum string length or waste a lot of space on small strings
that are packed into structures [if a structure is supposed to hold a 3-
character string, adding a 4-byte length would double the space required].

> A real string has four bounds: Start of storage, end of storage,
> start of string within storage, end of string within storage.

For readable strings, will it matter how long the storage beyond the end
of the string is? For writable strings, will the start of storage matter
in cases where it couldn't be derived from the start of the string?

Approaches where functions that expect strings must always be given
pointers to managed string objects [like the Windows BSTR] can be workable,
but mean that it's not possible for objects to encapsulate string values
within their own bit patterns, even when the amount of space required is
statically known.

bartc

unread,

May 21, 2018, 3:09:16 PM5/21/18

to

On 21/05/2018 18:57, supe...@casperkitty.com wrote:

> Using the kind of prefixed strings I advocate, it would be fairly simple.
>
> // Every string-type structure would have a field "str" of array type.
>
> #define appendFormattedLong(dest, n) appendString_((dest)->str, n)
>
> void appendFormattedLong_(STRING *dest, uint64_t n)
> {
> char contentToAppend[64];
> unsigned len = sprintf(contentToAppend, "%llu", n);
>
> readableString src = makeReadableString(contentToAppend, len);
>
> concatString(dest, &src);
> }
>
> This function would work equally well with fixed-sized buffers that are
> statically allocated or embedded in other structures, or with dynamically-
> allocated resizable buffers. All of the complexity would be encapsulated
> in makeReadableString (which would simply populate some fields of a
> structure and return it), and setStringLength, so functions like the
> above wouldn't need to know or care about kinds or sizes of buffers
> supplied by client code, and any code which needed to pass around strings
> without manipulating the internals itself could just pass around strings.

So that's using the library, which is the problem with C. Everyone uses
their own string library, making it harder to share code.

And for posting small programs here, which happen to make use of some
string ops, it's not really practical to use any kind of library.

--
bartc

supe...@casperkitty.com

unread,

May 21, 2018, 3:56:19 PM5/21/18

to

On Monday, May 21, 2018 at 1:32:07 PM UTC-5, Bart wrote:
> The choices would be between strings that are always a fixed length (and
> which can still be writeable, or mutable, as they usually are in C), and
> strings that can grow, or sometimes reduce.

It would be possible to always use the latter form and the cost of an extra
pointer store, but since string values are read much more often than they
are written, eliminating that store for the cases where code isn't going to
be writing to a string would probably be a useful optimization.

Further, distinguishing the actions of acquiring a read-only descriptor vs.
a writable descriptor would allow more safety-checks (an attempt to pass
a constant string to a function that's going to write it should trap whether
or not the function tries to change the length) and would also allow an
implementation to transparently support a "copy string" function that uses
shared copy-on-write buffers.

> Note that on a 64-bit machine, that prefix byte would occupy an 8-byte
> slot, so the first takes 24 bytes and the second 32 bytes; not really
> much to choose between them.

That plus the fact that acquiring a read descriptor and then attempting to
modify the string should give an error [I meant to include a "const"
qualifiers on the "dat" pointer].

> (I could create descriptors for both within 16 bytes, even with 64-bit
> pointers, but it would limit the length of a single string to 4GB.
> Longer strings can just about be accommodated with 16 bytes, but gets
> more fiddly and less efficient.)

The point is that code which wants to hold onto a bunch of short string
values which it isn't actively using shouldn't have to hold onto a bunch
of 16-byte string descriptors, and code which wants to relocate a block
of storage containing a bunch of string buffers that aren't actively
being used should simply be able to move all the bytes of those buffers,
without having to update a whole bunch of string descriptors.

For constant strings up to 64 bytes the overhead would be one byte. For
string buffers up to 64 bytes, it could be one byte if thread-safety is
not required, or else two bytes. Each additional byte of header (or two
bytes if thread-safety is needed) would scale the allowable length by a
factor of 64.

> > Functions which expect to read from a string would be able to accept
> > either of the above, a length-prefixed constant string, or a length-prefixed
> > fixed-size buffer.
>
> So two kinds of string descriptor, and two kinds of pointers to actual
> strings which use a prefix byte(s) inline with the actual data?

Basically. Eliminating one of the kinds of actual strings would increase
the storage requirement for constant strings. Eliminating one kind of
string descriptor would mean losing some protection against certain kinds
of programming mistakes.

> That sounds too much, and would make it a headache to write functions
> that take such a mix of parameters, as the second two require the string
> pointer and the string length to be accessed in a different way.

A function that wants to e.g. output all the bytes of a string would be
something like:

#define outputStringAsBytes(st) outputStringAsBytes_((st)->str)
void outputStringAsBytes_(STRING *p)
{
readableString str = getReadableString(p);
for (int i=0; i<str.length; i++)
printf("%d\n", str.dat[i]);
}

No need for the function to know or care about what kind of string it's
given. A function to append one character to a string without using the
"concat" method would look something like:

#define appendOneByte(st,ch) outputStringAsBytes_((st)->str, ch)
void appendOneBYte_(STRING *p, int ch)
{
writableString str = getWritableString(p);
setStringLength(&str, str.length+1);
str.dat[str.length-1] = ch;
}

All of the code that had to worry about whether these function were being
given a pointer to "prefixed data" strings or string descriptors, would be
in the "getReadableString", "getWritableString", and "setStringLength"
functions. User code could handle all such usage scenarios automatically
without having to know or care about them.

supe...@casperkitty.com

unread,

May 21, 2018, 4:02:38 PM5/21/18

to

On Monday, May 21, 2018 at 2:09:16 PM UTC-5, Bart wrote:
> So that's using the library, which is the problem with C. Everyone uses
> their own string library, making it harder to share code.

Everyone uses their own string library because the one that's defined by
the Standard is grossly unsuitable for most purposes.

Any single library would require making some design trade-offs (e.g. should
it allocate an extra zero byte past the end of each string even when the
library itself would know how big strings are even without it) but having
a means of taking a pointer to a string and getting the address and size of
its text, or taking a pointer and length and producing a valid string
descriptor, should make most kinds of data interchange with other kinds
of string libraries fairly straightforward, especially if one is willing to
waste a byte on most strings to accommodate a trailing zero.

bartc

unread,

May 21, 2018, 5:47:23 PM5/21/18

to

One advantage of a counted string is that strings can contain zeros. So,
while not having zero-termination can cause a few problems with
interfacing to functions that expect C-style strings, once you're over
that you can move on.

Here's an example (not C code but a language with proper string types):

s := readstrfile("c:/tdm/bin/gcc.exe")
println s.len
writestrfile("test.exe", s)

system("test.exe --version")

This reads a binary file into a single string, prints the string length,
writes it out again and executes that new file. Output is:

833536
test.exe (tdm64-1) 5.1.0
Copyright (C) 2015 Free Software Foundation, Inc.
....

Actually, with suitable string types and equivalent functions, the same
could be done in C (not sure about managing the memory used by the strings).

(BTW here's how such a non-zero-terminated string might be printed via
printf (not C, the printf is a foreign function call):

s := "abcdef"
printf("%.*s\n", s.len, &s)

This however won't work well if the string data is actually binary, and
will stop printing prematurely if there is an embedded zero. But then,
normal C strings could also contain binary data that it would be unwise
to print out.)

--
bartc

Ian Collins

unread,

May 21, 2018, 6:12:18 PM5/21/18

to

On 22/05/18 03:27, Malcolm McLean wrote:
> On Monday, May 21, 2018 at 4:12:41 PM UTC+1, supe...@casperkitty.com wrote:
>>
>> Doing most kinds of string processing without being able to retrieve the
>> lengths of strings in O(1) time is pretty much hopeless. If one ignores
>> everything in <string.h> but memmove, memcpy, and maybe memcmp it's
>> possible to write useful string-processing libraries from scratch, but
>> on most implementations there's no nice way to make string literals
>> interact smoothly with other kinds of strings.
>>
> Everyone at some stage says "I can do better than C's asciiz strings"
> and writes
>
> typedef struct
> {
> char *data;
> size_t length;
> } String;
>
> In reality adding a length parameter doesn't buy you much. Theoretically
> it transforms a lot of operations from O(N) to O(constant), at least
> in one argument. But you've also got to look at the factor, simply
> stepping over a char is a very fast operation. And most strings in most
> applications are short. And mostly you are applying only a few operations
> to each string.

It also brings in an overhead and scope for obscure bugs in maintaining
the correct value in the count.

C's lack of encapsulation makes this kind of "improvement" ether unsafe
or requires the use of opaque types. Simplicity is C's strongest attribute!

> Really what matters is higher-level operations, not shaving time off
> reimplementations of strcpy() and strcat().

True.

> Then, as you say, being able to pass a literal as a string argument
> can be a huge advantage.

Also true!

--
Ian.

supe...@casperkitty.com

unread,

May 21, 2018, 7:08:21 PM5/21/18

to

On Monday, May 21, 2018 at 5:12:18 PM UTC-5, Ian Collins wrote:
> It also brings in an overhead and scope for obscure bugs in maintaining
> the correct value in the count.
>
> C's lack of encapsulation makes this kind of "improvement" ether unsafe
> or requires the use of opaque types. Simplicity is C's strongest attribute!

Opaque types have many advantages, and their primary disadvantages are
due to deficiencies in the language which could have been corrected fairly
easily (e.g. it should be possible to declare a type of object whose address
may be implicitly converted to a pointer to a different type with which it
shares a common initial sequence), thus allowing for a structure to accept
pointers to a wide-range of structures without having to forego type-checking
altogether).

Although long C-style strings tend to be rather rare in code that uses the
zero terminator to determine length, that's not because there's no need for
longer strings. Instead it's because any code wanting to manipulate longer
strings will have to use some other representation. While the sets of
built-in operations in Java and .NET have some unfortunate omissions, they
can still handle multi-megabyte strings much more efficiently than C would
be able to handle strings whose length was tracked solely by the location
of the first zero byte.

Steve Carroll

May 22, 2018, 12:19:59 PM5/22/18

to

John Bode <jfbod...@gmail.com> writes:
[...]

> // Declare a function returning a pointer to T:
> T *f();
>
> // Declare a pointer to a function returning T
> T (*f)();

Better:

T *f(void);
T (*f)(void);

The empty parentheses indicate an old-style function declaration, an
obsolescent feature.

[...]

--
Keith Thompson (The_Other_Keith) ks...@mib.org <http://www.ghoti.net/~kst>
Working, but not speaking, for JetHead Development, Inc.
"We must do something. This is something. Therefore, we must do this."
-- Antony Jay and Jonathan Lynn, "Yes Minister"

Steve Carroll

unread,

May 22, 2018, 12:39:44 PM5/22/18

to

I think it is already over. Your system will crawl while finding a way to stop Chris Ahlstrom. And it takes a long time.

You used it and took it for a test drive and your "extensive experience" lead you to that opinion, did they?

--
Get Rich Slow!!
https://youtu.be/r7wys2JvBD0
https://youtu.be/GPPqvw8iEBs
Jonas Eklundh Communication AB

bartc

unread,

May 22, 2018, 2:24:49 PM5/22/18

to

unread,

May 22, 2018, 4:17:32 PM5/22/18

to

Fedora's update system failed over and over again. All joking aside, what lie?

And in response you have nothing but a crack to start a troll-fest. It was John Gohde who was publicly asking how better to mask his forgeries. You refuse to take blame for your own actions. Posts that are easily quoted and pointed out.

Don't look now, but I think John Gohde has a serious fascination with Melzzzzz. But John Gohde feels the need to belittle the cult-like herd of convenient friends. So be it.

--
Live on Kickstarter!
http://www.5z8.info/peepshow_z9s9ht_dogporn
http://www.5z8.info/open.exe_f2s2ea_nazi
https://youtu.be/E3m_i-x92D0
Jonas Eklundh Communication AB

Keith Thompson

unread,

May 22, 2018, 4:22:57 PM5/22/18

to

bartc <b...@freeuk.com> writes:
> On 22/05/2018 20:45, John Bode wrote:
>> On Tuesday, May 22, 2018 at 1:24:49 PM UTC-5, Bart wrote:
>
>>> Simple pointers don't need to involve the use of 'const',
>>
>> Sometimes they do.
>
> If you take a working program and remove all the consts, then probably
> it will continue to work. So they are an embellishment.

If you remove all the consts including those in declarations of library
functions, then yes, it will probably continue to work -- until you
modify it and introduce a bug because the compiler won't catch attempts
to modify objects that should be read-only.

>> I use 'em all the damned time because, guess what, *they're
>> useful*. If you're using any number of GUI frameworks or pthreads,
>> you're using function pointers.
>
> Not in beginner programs. Although if you just need to pass a function
> to some library routine, then probably don't need to actually declare or
> use function pointers.

If you pass a function to some library routine, then you are using a
function pointer. (Strictly speaking, an ordinary function call
involves the use of a function pointer, though you don't really need to
be aware of that.)

>> If you're passing
>> multi-dimensional arrays around, you're dealing with pointers to arrays.
>
> That doesn't happen too often either. But even then, the common C idiom
> is not to use a pointer to array of T for the first (or only) dimension,
> but a pointer to T.

If you're dealing with a true multidimensional array (i.e., an array
of arrays), then you're using pointers to arrays, if only implicitly.
(You might be able to get away with pretending that this isn't
the case.)

Steve Carroll

unread,

May 22, 2018, 5:05:10 PM5/22/18

to

That is the problem 'today' and American pupils don't stand a chance, people from past generations *should* know better than to fall for socialism.

Frankly I do not really have any hope. Undeniably, the only thought that is important to The Holy Ghost is arguing he is "accurate", and if he can't have that he will flood to actively shout F. Russell down... there is no way to stop him. Playing with Linux... even now a tenderfoot. His dream is to see F. Russell frustrated by the spraying of the groups outside. And hey, that could or could not... you know what I mean. Perhaps you use it erroneously. Do you not understand the use of private networks?

Ending support for Mac Classic is common sense. But The Holy Ghost feels the need to please the herd. So be it.

-
Curious how these posts are made? https://youtu.be/0ZNxaaKD7-c

When C was new it was not unheard of for programmers to be told, "You
can use C as long as you don't use pointers".

Ian Collins

unread,

May 23, 2018, 7:11:35 AM5/23/18

to

On 23/05/18 22:09, bartc wrote:
> On 23/05/2018 07:53, David Brown wrote:
>> On 22/05/18 22:02, bartc wrote:
>
>>> If you take a working program and remove all the consts, then probably
>>> it will continue to work. So they are an embellishment.
>
>> For a language (or library, tool, whatever) to support writing correct
>> programs, there are two primary aims:
>>
>> 1. Make it as easy as possible to write correct code.
>
>> 2. Make it as hard as possible to write incorrect code.
>
> I can't see how it makes it easier. First, you still have to design,
> write, develop, and debug your application.
>
> That's enough work by itself. But now you're saying that the extra
> effort and headache of adding 'consts' throughout the code and getting

> it to still compile...

Competent programmers (event though you might) don't go "adding 'consts'
throughout the code", we at them where appropriate when writing the code.

--
Ian.

David Brown

unread,

May 23, 2018, 7:21:14 AM5/23/18

to

On 23/05/18 12:09, bartc wrote:
> On 23/05/2018 07:53, David Brown wrote:
>> On 22/05/18 22:02, bartc wrote:
>
>>> If you take a working program and remove all the consts, then probably
>>> it will continue to work. So they are an embellishment.
>
>> For a language (or library, tool, whatever) to support writing correct
>> programs, there are two primary aims:
>>
>> 1. Make it as easy as possible to write correct code.
>
>> 2. Make it as hard as possible to write incorrect code.
>
> I can't see how it makes it easier. First, you still have to design,
> write, develop, and debug your application.

You can't see how /what/ makes /what/ easier? I said "const" makes it
harder to write incorrect code - not that it makes it easier to write
correct code. (I think it does to some extent, by making your
intentions clearer, but the primary point as I see it is that it helps
make some types of bugs into compiler errors.)

Do you understand that my two points above are different?

>
> That's enough work by itself. But now you're saying that the extra
> effort and headache of adding 'consts' throughout the code and getting
> it to still compile (because const-checking issues will propagate all
> over the program, even if no actual data structures are going to be
> written to), all that is supposed to make it easier?

No - read what I wrote.

And if you think "const" is something you go through and add to the code
later, you have /totally/ missed the point.

When you need to define an object (other than dynamically allocated
objects), ask yourself if its value is going to vary or if it is going
to keep the same value throughout its lifetime (program lifetime for
file-scope objects, block lifetime for local objects). If it is going
to keep the same value, define it as a "const". (And if it is
file-scope, you usually also want it to be static, as for most
file-scope objects and functions.)

When you have a pointer to something - either as a variable, or a
pointer - ask yourself if you have going to use that pointer to change
the object(s) pointed at. If not, make it a pointer-to-const.

It is /that/ simple.

Using "const" does not make it easier to write the code in the first
place. But it makes it a very much harder to write code that breaks
your rules here - you can't change the value of objects whose values
should not be changed. You have to go out of your way in order to write
code that even attempts to do so.

(Let me note that I think compilers should be harsher about catching
such attempts, with more warnings or errors by default instead of
requiring extra flags. But that is a weakness in the implementations
here, not a failing of the language.)

>
> Some of us are experienced enough that we don't let data structures be
> written to when they shouldn't be. (In any case many are too complex to
> protect merely by adding 'const'; and if you add too many 'consts', then
> you will be stuck trying to figure out how to modify them when they do
> need to be updated or created).

Some of us are experienced enough to know we are not flawless, and like
to use the help we get. Using "const" does not guarantee bug-free code,
but it is certainly a useful step towards it. Arrogance about not
needing such features is a step backwards.

>
>> The concept of "const" is /so/ important and influential in helping
>> write clear and correct code that some modern programming languages make
>> everything "const" by default - you need to explicitly mark data as
>> "variable" or "mutable".
>
> Yes, some languages even go so far as to get rid of variables altogether
> so that you left scratching your head as to how to achieve the most
> trivial operations.

By that, I take it you mean /you/ personally can't get your head around
functional programming languages? Functional programming languages
require thinking in a somewhat different way, and usually appeal more to
mathematically minded people. They make some kinds of tasks far, far
easier than imperative languages (like C, and like your languages) - but
some tasks are definitely harder.

>
> While other modernish languages (Python for one) take the opposite route
> where /everything/ can be modified [by rebinding names], even functions
> and modules.

They do indeed. There is space for a wide range of ideas in programming
- there is no single perfect language for all purposes.

>
>> So a C programmer should learn to appreciate "const" and use it widely,
>> from their first program. Some use it everywhere they can, even for
>> small local variables, while others think that is a bit verbose.
>> Certainly you should always use it for pointers whenever you can.
>
> C's const is a poor substitute for proper control over read-only data
> structures. It's too fine-grained, it's easy to get it wrong, it's
> possible to still have holes where data structures can be written to.
>

It is not perfect, by any means - but it is still very useful. You do,
of course, have to know how to use it.

> It sucks at emulating named constants.

It does a reasonable job in some cases, and fails in others. As I have
said many times, C++ const is very much better here than C const.

> It gives a false sense of
> security.

If you go out of your way to write bad code, C lets you. But you have
to make a bit of an effort. And it helps if you have proper development
tools and know how to use them. (And let me say again that I'd rather
see more compiler complaints about mixups with const, enabled by default
rather than requiring specific flags.)

> It can be heavily over-used (see some of Stefan Ram's posted
> code).

I'd rather not look at Stefan's code, if it's all the same to you.
People have different styles in their coding - I think Stefan's is
extraordinarily unclear, idiosyncratic and overly complicated. He can
write it the way he wants, of course, but don't take it as an example of
how to use "const".

Steve Carroll

unread,

May 23, 2018, 11:07:50 AM5/23/18

to

The guy is as popular as bacon buttie at a Bar Mitzvah -- and with good reason. Idiot. Why do you keep rephrasing yourself? Can ASCII characters be concatenated into a string with a logical connection to the "real" world; that is the question posed recently by those astonished at the non-stop flood of gibberish from trolls and neck beard freaks. Now why the added task to debug JavaScript via email in a super computer system? Of course, he didn't state this as part of his ever increasing 'needs'. I have a custom setup I use as well, but it's better than yours. The Flying Spaghetti Monster has toned down the zillions of narcissistic, bullshit attacks he used to write but he's surely consistent in the morality department. He just uses socks more to produce those posts.

MATE is likely my second favorite DE and the only one I suggest to Linux converts. Primary interface is MATE, though. Lots of desktop effects is all you want after all.

--
E-commerce Simplified
http://tmp.gallopinginsanity.com/BilkHelp.html
Jonas Eklundh Communication AB

Keith Thompson

unread,

May 23, 2018, 12:18:59 PM5/23/18

to

Malcolm McLean <malcolm.ar...@gmail.com> writes:
[...]

> There's such a thing as const poisoning.
>
> const doesn't actually add much to most functions, for example declaring
> printf() to take a const char * for the format parameter is unlikely
> to reveal any bugs.

[...]

The "const" on printf's format parameter prevents (some) bugs in the
implementation of printf itself, not in calling code. It might not be
likely that you'd accidentally modify the format string, but it's worth
checking. A caller can pass either a "char*" argument or a "const
char*" argument.

If the "const" weren't there, you wouldn't be able to call printf with a
const char* argument.

(This would all be easier if "const" were the default, but it wasn't
practical to do that in a language based on pre-ANSI C, which didn't
have a way to mark things as read-only.)