[Boost-users] Why is there so much co-dependency in Boost? Is there anything to be done about it?

st...@parisgroup.net

unread,

Sep 3, 2012, 1:43:13 AM9/3/12

to boost...@lists.boost.org

Greetings,

As a veteran C++ programmer, I've been an admirer of the Boost library for many years. I've used it at a number of companies I've worked for, especially the SmartPtr library.

Right now I'm working for a company that worries a lot about negative exposure to Open Source software issues such as questions that might arise about authorship, copyright or even patent issues. The company does allow the use of Open Source software, but it requires that each piece of code that is brought in first be justified and vetted. Unfortunately, I'm finding this to be a nearly impossible task when I look at the amount of code that must be compiled to use the Boost modules I'm interested in.

I've done a study and written some tools, to determine just how many Boost header files must be included to use some of the Boost modules. The numbers are staggering:

Any: 79

FileSystem: 276

Smart Ptr: 382

String Algo: 180

I went on and did a tally of which modules these header files came from. Here are the counts for how many other Boost modules each of these modules depends upon:

Any: 8

FileSystem: 13

SmartPtr: 8

String Algo: 15

Given these numbers, I think I have to abandon any thoughts of using Boost within my current company. There's no way I'm going to get approval to bring so much code into our work just to get a SmartPtr or FIleSystem library. This is unfortunate, because due to Boost's existence there doesn't seem to be much work going on out there to offer lighter weight alternatives.

I'm writing this request in the hopes that there's something I'm missing here, and that someone can point out my folly. Is there a way to utilize any of these Boost modules in such a way that they do not require the inclusion of so much code? If not, does anyone have any suggestions as to how to make this fly with my boss? Has this issue come up before and been considered by the Boost designers? I find the issue baffling.

TIA for any help or insight anyone can provide.

Steve

Jürgen Hunold

unread,

Sep 3, 2012, 11:39:42 AM9/3/12

to boost...@lists.boost.org

Hi Steve,

On Sunday, 2. September 2012 22:43:13 st...@parisgroup.net wrote:
> Greetings,
>
> As a veteran C++ programmer, I've been an admirer of the Boost library for
> many years. I've used it at a number of companies I've worked for,
> especially the SmartPtr library.

Well, working without SmartPtr is a no-go.

>
> Right now I'm working for a company that worries a lot about negative
> exposure to Open Source software issues such as questions that might arise
> about authorship, copyright or even patent issues. The company does allow
> the use of Open Source software, but it requires that each piece of code
> that is brought in first be justified and vetted. Unfortunately, I'm
> finding this to be a nearly impossible task when I look at the amount of
> code that must be compiled to use the Boost modules I'm interested in.

Company policies, sigh.

> I've done a study and written some tools, to determine just how many Boost
> header files must be included to use some of the Boost modules.

Please provide more detail on exactly how you measured this lines. Example
programs might be best.

> The numbers are staggering:

> Any: 79
>
> FileSystem: 276
>
> Smart Ptr: 382
>
> String Algo: 180

Well, nearly everything depends on Boost.Config. Config and especially SmartPtr
contain lots of conditionally compiled platform dependent code.

> I went on and did a tally of which modules these header files came from.
> Here are the counts for how many other Boost modules each of these modules
> depends upon:
> Any: 8
>
> FileSystem: 13
>
> SmartPtr: 8
>
> String Algo: 15

These numbers seem too high. A bit more detail is needed to further comment on
this.

> Given these numbers, I think I have to abandon any thoughts of using Boost
> within my current company. There's no way I'm going to get approval to
> bring so much code into our work just to get a SmartPtr or FIleSystem
> library. This is unfortunate, because due to Boost's existence there
> doesn't seem to be much work going on out there to offer lighter weight
> alternatives.

Well, SmartPtr is in c++11. But this is out of reach for most people, I guess.

> I'm writing this request in the hopes that there's something I'm missing
> here, and that someone can point out my folly. Is there a way to utilize
> any of these Boost modules in such a way that they do not require the
> inclusion of so much code?

If you simpley grep for "include" you get lots and lots of files, especially
for low level things like Config or SmartPtr with lots of code for lots of
platforms ;-)
Did you take a look at the "bcp" tools and tried to extract some modules? This
should get you more insight.

> If not, does anyone have any suggestions as to
> how to make this fly with my boss? Has this issue come up before and been
> considered by the Boost designers? I find the issue baffling.

No ideas there, I got Boost in 10 years ago without anyone noticing.

Yours,

Jürgen
--
* Dipl.-Math. Jürgen Hunold !
* voice: ++49 4257 300 ! Fährstraße 1
* fax : ++49 4257 300 ! 31609 Balge/Sebbenhausen
* jhu...@gmx.eu ! Germany
_______________________________________________
Boost-users mailing list
Boost...@lists.boost.org
http://lists.boost.org/mailman/listinfo.cgi/boost-users

Jeremiah Willcock

unread,

Sep 3, 2012, 2:20:57 PM9/3/12

to boost...@lists.boost.org

On Mon, 3 Sep 2012, J�rgen Hunold wrote:

(snip)

>> Given these numbers, I think I have to abandon any thoughts of using Boost
>> within my current company. There's no way I'm going to get approval to
>> bring so much code into our work just to get a SmartPtr or FIleSystem
>> library. This is unfortunate, because due to Boost's existence there
>> doesn't seem to be much work going on out there to offer lighter weight
>> alternatives.
>
> Well, SmartPtr is in c++11. But this is out of reach for most people, I guess.

It is also in TR1, which might be more accessible.

-- Jeremiah Willcock

Jeffrey Lee Hellrung, Jr.

unread,

Sep 3, 2012, 5:58:09 PM9/3/12

to boost...@lists.boost.org

I think at some point in the past it was suggested that Boost could use a tiered system, whereby libraries on tier n could only depend on libraries in tiers 0, ..., n. The tier of a library would give some indication as to how fundamentally and generally useful that library is. I don't know how practically informative such a stratification would ultimately be, though.

- Jeff

Sebastian Redl

unread,

Sep 4, 2012, 3:15:26 AM9/4/12

to boost...@lists.boost.org

On 03.09.2012 07:43, st...@parisgroup.net wrote:

Greetings,

Right now I'm working for a company that worries a lot about negative exposure to Open Source software issues such as questions that might arise about authorship, copyright or even patent issues. The company does allow the use of Open Source software, but it requires that each piece of code that is brought in first be justified and vetted.

Leaving the issue of Boost's structure aside, how does this vetting process even work? I can maybe see someone looking over the code to look for well-known patented software patterns (Does anyone know such? I can't think of any outside of lockfree programming.), but how on earth would you verify authorship or copyright, beyond what the file says? Do you scan the commit history of the files? Also, what's the point? The code is released by everyone who contributed to it under the liberal Boost license. You won't find any comments or commit messages to the point of "oh, by the way, this isn't under the BSL". Or maybe someone who looks it over is expected to find similarities to other software, which isn't under the BSL? This is, in my opinion, completely impossible - there's just way too much code out there to compare to to make any significant difference. How would that look in court? "Yeah, these people claim that Boost contains some of their GPL code, but when we decided to use Boost, we compared its code to 0.0001% of the GPL code in existence (a generous estimate) and didn't find similarities, so we shouldn't be liable!"
Maybe you can find a way to convince your boss that the policy just doesn't make sense.

Sebastian

Larry

unread,

Sep 4, 2012, 6:36:52 AM9/4/12

to boost...@lists.boost.org

I have not been through the Boost source but I have gone through other open source code in this kind of process. It is amazing how much code exists that still has comments like “Property of ...” or “Copyright ....” regardless of the general say GPL license for the project source. The person providing the code means well but may not understand the legal ramifications. Algorithms may be patented (such as some crypto ones) so unless the open source project provides a method of ensuring such code is not included when it is built you might have to get a license to use the project. The copyright registration in the US at least requires the names of all contributors and their involvement. If you contribute it needs to be clear that you have the right to contribute as some companies may assert rights through employment agreements even if it is not directly in your job scope. If a company uses open source code without due diligence to the origin of code the results could be patent infringement litigation, substantial licensing fees, a required change to a company’s product to remove functionality related to the open source code, and/or other similarly unpalatable options.

Larry

Nat Linden

unread,

Sep 4, 2012, 9:23:00 AM9/4/12

to boost...@lists.boost.org

On Mon, Sep 3, 2012 at 1:43 AM, st...@parisgroup.net
<st...@parisgroup.net> wrote:

> Right now I'm working for a company that worries a lot about negative
> exposure to Open Source software issues such as questions that might arise
> about authorship, copyright or even patent issues. The company does allow
> the use of Open Source software, but it requires that each piece of code
> that is brought in first be justified and vetted.

I, too, used to work for a company that fretted about such things. In
my case the problem became moot when I left to work at a more
reasonable company.

> I'm writing this request in the hopes that there's something I'm missing
> here, and that someone can point out my folly. Is there a way to utilize
> any of these Boost modules in such a way that they do not require the
> inclusion of so much code? If not, does anyone have any suggestions as to
> how to make this fly with my boss? Has this issue come up before and been
> considered by the Boost designers? I find the issue baffling.

I'd like to defend Boost library authors (no central group of
designers, btw). The process is just as you've seen in this mailing
list. An author builds a library and proposes it for inclusion in
Boost. Once a review manager steps forward, the library is offered for
review. Often the library is modified to satisfy review feedback; but
in due course it is accepted into Boost by consent of the community.

Those of us who already use and appreciate Boost are the ones most
likely to submit a library for consideration. Why would any one of us
refrain from using valuable Boost features in building that new
library? We are programmers trying to solve a technical problem, not
lawyers inventing doomsday scenarios.

Just as with the recent evolution of the C++ language itself, many of
the most powerful Boost libraries are specifically designed to make it
easier to build other libraries. When Boost offers support that cannot
readily be licensed elsewhere -- support that would be prohibitively
expensive to rebuild from scratch! -- a good engineer will leverage
it. Good engineering involves reusing available technology rather than
reinventing it. Forbidding engineers from using available technology
is the opposite of good engineering.

It is far from surprising to me that Boost libraries heavily use other
Boost libraries. That seems entirely appropriate.

Paul A. Bristow

unread,

Sep 4, 2012, 12:16:37 PM9/4/12

to boost...@lists.boost.org

From: boost-use...@lists.boost.org [mailto:boost-use...@lists.boost.org] On Behalf Of
Jeffrey Lee Hellrung, Jr.
Sent: Monday, September 03, 2012 10:58 PM

To: boost...@lists.boost.org
Subject: Re: [Boost-users] Why is there so much co-dependency in Boost? Is there anything to be done
about it?

> On Sun, Sep 2, 2012 at 10:43 PM, st...@parisgroup.net <st...@parisgroup.net> wrote:
Greetings,
> As a veteran C++ programmer, I've been an admirer of the Boost library for many years. I've >
used it at a number of companies I've worked for, especially the SmartPtr library.
> Right now I'm working for a company that worries a lot about negative exposure to Open
> Source software issues such as questions that might arise about authorship, copyright or > > even
patent issues. The company does allow the use of Open Source software, but it requires >that each
piece of code that is brought in first be justified and vetted. Unfortunately, I'm >finding this to
be a nearly impossible task when I look at the amount of code that must be >compiled to use the
Boost modules I'm interested in.

I don't think that you can avoid using all-of-Boost or none-of-Boost. There are very good reasons
for the co-dependency. Concentrate your arguments on the massive benefits you are getting?

Boost has put a lot of effort into making sure that all files (including documentation and graphics)
contain an authors' copyright claim notice (and that it is a correct claim - and have rejected items
for which this was not possible).
(There is a 'sinners list' at http://boost.cowic.de/rc/inspect-trunk.html - so you can check the
few libraries and files that do not conform. Don't panic at the number of items missing - few are
actual C++ code - most are in documentation and examples).

Although it is unlikely that any code contains patentable concepts, it is not possible to be sure
until this is tested in the courts. Boost would not knowingly provide software that is patented
without marking it as such clearly.

Because Boost is a leader in software development, that a library is accepted for Boost makes it
very much less likely that it is already patented, and the code being placed in the public domain
may provide evidence of 'prior art' to thwart future attempts to obtain a patent.

Finally, there are very many companies using Boost and none, to my knowledge, have reported getting
into trouble so far.

You will need to work to get a blanket acceptance of the all Boost libraries?

HTH

Paul

---
Paul A. Bristow,
Prizet Farmhouse, Kendal LA8 8AB UK
+44 1539 561830 07714330204
pbri...@hetp.u-net.com

Robert Ramey

unread,

Sep 4, 2012, 2:29:56 PM9/4/12

to boost...@lists.boost.org

st...@parisgroup.net wrote:
> Greetings,
> As a veteran C++ programmer, I've been an admirer of the Boost
> library for many years. I've used it at a number of companies I've
> worked for, especially the SmartPtr library.
>

> I've done a study and written some tools, to determine just how many
> Boost header files must be included to use some of the Boost modules.
> The numbers are staggering:

...

This is a legitimate concern which IMO has not been taken seriously by the
Boost community.
To some extent it's unavoidable since it's more efficcient to have one
"best" solution
imported everywhere rather than replecating code.

But it doesn't have to be as bad as it currently is. And it's only getting
worse as more libraries
get added to Boost.

Here's what I suggest for boost users such as yourself:

a) don't use "convenience headers" which suck in all the headers in a
library
rather than just the one's used. It seems tedious because one has to read
more carefully to know which headers to use but it saves huge amounts of
build time and diminishes "dependency surprises" which can cost a lot of
time to track down.

b) a few libraries "infect" your code with "spreading" depencies. Use your
analysis to detect these libraries and complain about them. I have done
this in the past to no effect. It did make me feel I was doing something
though. If you can't get library authors to see this, you'll just have to
avoid
that library or fork your own copy.

c) In your own code, use pre-compiled headers. This speeds up re-builds.
It DOES force you to spend some time looking at how you've
divided up your own code but this time is a good investment.

Here's what I suggest library author's do:

a) Take this gentlement's complaint seriously.
b) Consider eliminating "convenience" headers.
c) When writing documentation avoid depence on "convenience headers".
This seems like it adds some work - less conveniece - but hels addressing
this man's problem.

Here's what Boost can do. Something like:

a) Formalize the comcept of boost library "levels"
i) core - e.g. config, auto-link, BOOST_NONCOPIABLE, ...
ii) utility - e.g. scoped_ptr, ...
iii) application support - e.g serialization

Every boost component would depend only on components at the same or lower
level.
The assignment of library level would be part of the review process.

The motivation here is to support the future growth of Boost and C++
libraries in general.
I see this as the fundamental requirement behind any efforts to achive
"Boost Modularization"

Robert Ramey

Chris Cleeland

unread,

Sep 4, 2012, 1:36:10 PM9/4/12

to boost...@lists.boost.org

On Mon, Sep 3, 2012 at 12:43 AM, st...@parisgroup.net <st...@parisgroup.net> wrote:

Right now I'm working for a company that worries a lot about negative exposure to Open Source software issues such as questions that might arise about authorship, copyright or even patent issues. The company does allow the use of Open Source software, but it requires that each piece of code that is brought in first be justified and vetted. Unfortunately, I'm finding this to be a nearly impossible task when I look at the amount of code that must be compiled to use the Boost modules I'm interested in.

The Boost license is pretty liberal as well as being pretty explicit. I've been at a number of companies where they've had their legal teams vet the license terms and, as long as license terms pass muster, the code is welcome from a legal perspective.

--
Chris Cleeland

Oliver Kowalke

unread,

Sep 4, 2012, 1:38:41 PM9/4/12

to boost...@lists.boost.org

Am 04.09.2012 20:29, schrieb Robert Ramey:
> a) don't use "convenience headers" which suck in all the headers in a
> library rather than just the one's used.

I've read an advice on the mailing list that boost libraries should
provide a 'all.hpp' (containing all headers of the library). Is it not
recommended any more?

Lars Viklund

unread,

Sep 4, 2012, 6:10:03 PM9/4/12

to boost...@lists.boost.org

On Tue, Sep 04, 2012 at 07:38:41PM +0200, Oliver Kowalke wrote:
> Am 04.09.2012 20:29, schrieb Robert Ramey:
>> a) don't use "convenience headers" which suck in all the headers in a
>> library rather than just the one's used.
>
> I've read an advice on the mailing list that boost libraries should
> provide a 'all.hpp' (containing all headers of the library). Is it not
> recommended any more?

I'd say they serve two different types of crowds. There's the people
that care about dependencies and build times, and there's the people
that want to pull in everything for proof-of-concept or exploratory
coding without looking up everything constantly in documentation.

It would be a major disservice to the latter kind if there was no
boost/foo/kitchen_sink.hpp for library foo.

There's an additional reason as to why some people include all-headers,
and that's documentation that doesn't clearly mention in which headers
different functionality exists.

There's some Boost libraries that are completely implicit about which
headers contains what parts, which makes it some major guesswork to get
the bits and pieces you want included, especially if you do not have
#include-completion in your editor.

--
Lars Viklund | z...@acc.umu.se

Robert Ramey

unread,

Sep 4, 2012, 10:04:26 PM9/4/12

to boost...@lists.boost.org

Lars Viklund wrote:
> On Tue, Sep 04, 2012 at 07:38:41PM +0200, Oliver Kowalke wrote:
>> Am 04.09.2012 20:29, schrieb Robert Ramey:
>>> a) don't use "convenience headers" which suck in all the headers in
>>> a library rather than just the one's used.
>>
>> I've read an advice on the mailing list that boost libraries should
>> provide a 'all.hpp' (containing all headers of the library). Is it
>> not recommended any more?
>
> I'd say they serve two different types of crowds. There's the people
> that care about dependencies and build times, and there's the people
> that want to pull in everything for proof-of-concept or exploratory
> coding without looking up everything constantly in documentation.
>
> It would be a major disservice to the latter kind if there was no
> boost/foo/kitchen_sink.hpp for library foo.

OK - we can compromise on this point. Authors can provide
convenience headers for this type of user ...

>
> There's an additional reason as to why some people include
> all-headers, and that's documentation that doesn't clearly mention in
> which headers different functionality exists.

while at the same time (re)organizing the documentation and code to
support the inclusion of only necessary code.

> There's some Boost libraries that are completely implicit about which
> headers contains what parts, which makes it some major guesswork to
> get the bits and pieces you want included, especially if you do not
> have #include-completion in your editor.

Which is a source of the problems that the original poster is noting.

Let's update code/dcoument/review guidlines to include these
considerations.

Robert Ramey

Vicente J. Botet Escriba

unread,

Sep 5, 2012, 1:13:05 AM9/5/12

to boost...@lists.boost.org

Le 05/09/12 00:10, Lars Viklund a écrit :

> On Tue, Sep 04, 2012 at 07:38:41PM +0200, Oliver Kowalke wrote:
>> Am 04.09.2012 20:29, schrieb Robert Ramey:
>>> a) don't use "convenience headers" which suck in all the headers in a
>>> library rather than just the one's used.
>> I've read an advice on the mailing list that boost libraries should
>> provide a 'all.hpp' (containing all headers of the library). Is it not
>> recommended any more?
> I'd say they serve two different types of crowds. There's the people
> that care about dependencies and build times, and there's the people
> that want to pull in everything for proof-of-concept or exploratory
> coding without looking up everything constantly in documentation.
>
> It would be a major disservice to the latter kind if there was no
> boost/foo/kitchen_sink.hpp for library foo.
>
> There's an additional reason as to why some people include all-headers,
> and that's documentation that doesn't clearly mention in which headers
> different functionality exists.

This is a documentation bug and the user should just create a ticket so
that this is fixed.

>
> There's some Boost libraries that are completely implicit about which
> headers contains what parts, which makes it some major guesswork to get
> the bits and pieces you want included, especially if you do not have
> #include-completion in your editor.
>

As for example ...

Best,
Vicente

Ingolf Steinbach

unread,

Sep 5, 2012, 3:03:29 AM9/5/12

to boost...@lists.boost.org

Hi.

2012/9/4 Oliver Kowalke <oliver....@gmx.de>:

> I've read an advice on the mailing list that boost libraries should provide
> a 'all.hpp' (containing all headers of the library). Is it not recommended
> any more?

I think that the mere existence of all.hpp files is not the problem
here. What should be avoided is to use these all.hpp files within
boost itself as that would unnecessarily enlarge the dependency tree.
But then, I do not know whether such include directives currently
exist within boost.

Ingolf

Lars Viklund

unread,

Sep 5, 2012, 3:23:59 AM9/5/12

to boost...@lists.boost.org

On Wed, Sep 05, 2012 at 07:13:05AM +0200, Vicente J. Botet Escriba wrote:
> Le 05/09/12 00:10, Lars Viklund a écrit :

>> There's some Boost libraries that are completely implicit about which
>> headers contains what parts, which makes it some major guesswork to get
>> the bits and pieces you want included, especially if you do not have
>> #include-completion in your editor.
>>
> As for example ...

My statement was what I subjectively felt during hacking sprees when I
know what I want, but can't readily find where it's located when reading
about it in the main body of the documentation.

Going through the libraries, it seems to be mostly a matter of lack of
such detail in the prose non-reference sections, particularly around the
sections that introduce a new concept and for the inline samples, like:
* Bimap tutorial;
* Chrono user's guide;
* Concept;
* Context only has "include all.hpp or the individual headers", with no
reference listing of headers and types at all;
* Filesystem v3 only documents top-level filesystem.hpp and
filesystem/fstream.hpp;
* Iterator - the only mention of headers at all is in the top-level
bullet points for a few of the types;
* Lambda has a brief listing in Installing, but nothing in-line;
* Locale doesn't mention headers by name, but as it's Doxygenish, it
links most names to the header page where they live.
* MSM doesn't even mention any all-header, it just seems to assume you
know exactly what headers to use, and the reference section is flat;
* MPL tutorial;
* NumericConversion's only mention of any headers is in examples for
the sections;
* Parameter has a decent set of cross-referenced docs, once you find it
under section 7 - until that point no mention of the headers exists;
* PointerContainer lacks header mentions in both the tutorial and the
reference, with the latter surprising as the reference doesn't contain
any header information at all, instead having to go to the non-common
'Library headers' section;
* Polygon seems to be completely lacking any header information
whatsoever;
* ProgramOptions - no mention in the tutorial, overview or howto;
* PropertyTree - reasonably straightforward based on the names of the
headers in the reference; prose rarely links typenames to the right
headers;
* Range separates the reference and the header structure; confusing if
you've finally gotten used to information being in the reference
section;
* Test only mentions headers occasionally in samples, I've yet to find
anything standalone talking about includes at all, except for the
unrecommended standalone header;
* Thread lacks an overview of the headers, mentioning them occasionally
in the reference - I just give up and use the all-header;
* Wave has no top-level index, but the few sections mention all headers
by name and synopsis and links to (disabled in Trac) trunk versions of
the headers;

The libraries that use qbk-style documentation tends to be pretty
acceptable, as the common style means that you get a list of headers on
the landing page for the library. While not optimal while reading, it's
at least something.

This might not be a major problem if you're willing to manually
cross-reference the reference sections when reading the
tutorial/guide/introduction sections in which is where the meat of the
documentation usually is located, but it feels quite the wrong way
around to dive into header listings to find the type/concept you want,
when the main body of documentation around it is in the prose.

On an unrelated note, the next time I have to use the expandos in the
Iostreams or Serialization documentation, I'm going to cry. They're such
fiddly and tiny targets, so unfriendly that I think more than once about
using the library due to the inaccessibility of the documentation.

Now that I've insulted practically every single Boost author, let me say
"good job, everyone". The level of documentation in Boost is way better
than what you usually see out in the wild, it's just that it's quite
easy to feel disorientated.

--
Lars Viklund | z...@acc.umu.se

Neil Groves

unread,

Sep 5, 2012, 5:34:25 AM9/5/12

to boost...@lists.boost.org

On Sep 3, 2012 2:58 PM, "st...@parisgroup.net" <st...@parisgroup.net> wrote:
>
> Greetings,
>

>
>

> Right now I'm working for a company that worries a lot about negative exposure to Open Source software issues such as questions that might arise about authorship, copyright or even patent issues. The company does allow the use of Open Source software, but it requires that each piece of code that is brought in first be justified and vetted. Unfortunately, I'm finding this to be a nearly impossible task when I look at the amount of code that must be compiled to use the Boost modules I'm interested in.
>
>

Why would open source be more likely to infringe a patent than your own solution? It is less likely in fact since the public availability of code can be used to demonstrate prior art.

Incorporating third party software into your own QA process is often very sensible.

>
> I've done a study and written some tools, to determine just how many Boost header files must be included to use some of the Boost modules. The numbers are staggering:
>
>
>
> Any: 79
>
> FileSystem: 276
>
> Smart Ptr: 382
>
> String Algo: 180
>
>
>

I am not staggered. Additionally the number of files has to be one of the most useless software metrics in the known multiverse!

> I went on and did a tally of which modules these header files came from. Here are the counts for how many other Boost modules each of these modules depends upon:
>
>
>
> Any: 8
>
> FileSystem: 13
>
> SmartPtr: 8
>
> String Algo: 15
>
>
>
> Given these numbers, I think I have to abandon any thoughts of using Boost within my current company. There's no way I'm going to get approval to bring so much code into our work just to get a SmartPtr or FIleSystem library. This is unfortunate, because due to Boost's existence there doesn't seem to be much work going on out there to offer lighter weight alternatives.
>
>
>
> I'm writing this request in the hopes that there's something I'm missing here, and that someone can point out my folly. Is there a way to utilize any of these Boost modules in such a way that they do not require the inclusion of so much code? If not, does anyone have any suggestions as to how to make this fly with my boss? Has this issue come up before and been considered by the Boost designers? I find the issue baffling.
>

The issue simply isnt a development priority. We prefer to emphasise good interfaces to correct code. The internal header file coupling has a potential to increase your build time. Why else do you care (aside from justifying an arbitrary procedure)?

If you have compile time issues please bring these up concretely and with measurements.

>
>
> TIA for any help or insight anyone can provide.
>
>

My recommendation would be to look for another job!

I think all the Boost devs would like to improve modularisation, and efforts are already underway.

Frankly increasing the number of supported deployable configurations is unlikely to improve external quality factors. It is likely to be better to take the better tested ensemble of libraries.

Test and use what you like and leave what you dont.

I may be on my own here, but my suspicion would be that dogmatic enforcement of inappropriate rules is the primary problem.

If you can't trust your developers to include the approved libraries you probably cant trust then to make any change without supervision.

>
> Steve
>

Neil Groves

Robert Ramey

unread,

Sep 5, 2012, 1:23:34 PM9/5/12

to boost...@lists.boost.org

Lars Viklund wrote:
> On an unrelated note, the next time I have to use the expandos in the
> Iostreams or Serialization documentation, I'm going to cry. They're
> such fiddly and tiny targets, so unfriendly that I think more than
> once about using the library due to the inaccessibility of the
> documentation.

Hmmm - would just making these bigger address your concern here?
That would be easy to fix - just add a track item.
Or is it the whole concept of using the navigator. Note that the
serialization library has the index at the top of each page just
as all the other libraries do. Maybe just supressing the
navigator panel would be more to your taste. In Firefox at
least. one can right click to open a frame in a new window
at which point - as far as I can tell - it would be the same
as every other boost library.

I included the "expando" after the fact since I found it excedingly
tedious to navigate complex documentation through the pages
themselves

Robert Ramey.

Lars Viklund

unread,

Sep 5, 2012, 4:49:07 PM9/5/12

to boost...@lists.boost.org

On Wed, Sep 05, 2012 at 09:23:34AM -0800, Robert Ramey wrote:
> Lars Viklund wrote:
> > On an unrelated note, the next time I have to use the expandos in the
> > Iostreams or Serialization documentation, I'm going to cry. They're
> > such fiddly and tiny targets, so unfriendly that I think more than
> > once about using the library due to the inaccessibility of the
> > documentation.
>
> Hmmm - would just making these bigger address your concern here?
> That would be easy to fix - just add a track item.

I'm afraid that larger targets might space out the items too much.

> Or is it the whole concept of using the navigator. Note that the
> serialization library has the index at the top of each page just
> as all the other libraries do. Maybe just supressing the
> navigator panel would be more to your taste. In Firefox at
> least. one can right click to open a frame in a new window
> at which point - as far as I can tell - it would be the same
> as every other boost library.

A few things that might make it slightly more accessible:

* Have an "expand-all" button to expand everything, which would make the
sidebar much more similiar to the typical Boost library landing page.
For Serialization, expanding everything is at most two screenheights
or so, which is quite manageable and scrollable.

* If the sidebar becomes hideable, the documentation pages themselves
would need some prev/up/next navigation doohickeys.

* If the sidebar becomes hideable, there ought to be a top-level page
that contains the ToC.

Of these things, the "expand-all" and bigger twiddle targets would
likely be among the easier, while the other two would require a lot more
work.

> I included the "expando" after the fact since I found it excedingly
> tedious to navigate complex documentation through the pages
> themselves

It's an interesting idea and helps a lot with having to go
back-and-forth between the top-level page and any subpages of interest.
The downside is that it's hard to open a part you're interested in a new
tab/window and retain the sidebar frame.

One of the bothersome things about S11n/Iostreams documentation is that
it's impossible to link someone a subpage on IRC or on mail while
retaining the sidebar.

--
Lars Viklund | z...@acc.umu.se

Jens Müller

unread,

Sep 5, 2012, 5:29:49 PM9/5/12

to boost...@lists.boost.org

Am 04.09.2012 18:16, schrieb Paul A. Bristow:
> Because Boost is a leader in software development, that a library is accepted for Boost makes it
> very much less likely that it is already patented, and the code being placed in the public domain
> may provide evidence of 'prior art' to thwart future attempts to obtain a patent.

Do you have formal escrow procedures for that?

To proof mere existence in a legally sustainable way, it would be
sufficient to apply cryptographically signed timestamps.

German law provides for "qualified timestamps" (§ 9 Signaturgesetz). I
think they are based on European directives. What I don't know is
whether there are similar concepts in other important jurisdictions such
as the US.

-- Jens

Marshall Clow

unread,

Sep 5, 2012, 5:38:55 PM9/5/12

to boost...@lists.boost.org

On Sep 5, 2012, at 2:29 PM, Jens Müller <bl...@tessarakt.de> wrote:

> Am 04.09.2012 18:16, schrieb Paul A. Bristow:
>> Because Boost is a leader in software development, that a library is accepted for Boost makes it
>> very much less likely that it is already patented, and the code being placed in the public domain
>> may provide evidence of 'prior art' to thwart future attempts to obtain a patent.
>
> Do you have formal escrow procedures for that?
>
> To proof mere existence in a legally sustainable way, it would be sufficient to apply cryptographically signed timestamps.
>
> German law provides for "qualified timestamps" (§ 9 Signaturgesetz). I think they are based on European directives. What I don't know is whether there are similar concepts in other important jurisdictions such as the US.

No, Boost does not.
We've got svn history (and before that, cvs), and a whole slew of mailing list archives.

-- Marshall

Marshall Clow Idio Software <mailto:mclow...@gmail.com>

A.D. 1517: Martin Luther nails his 95 Theses to the church door and is promptly moderated down to (-1, Flamebait).
-- Yu Suzuki

Neil Groves

unread,

Sep 5, 2012, 6:16:14 PM9/5/12

to boost...@lists.boost.org

> I've done a study and written some tools, to determine just how many
> Boost header files must be included to use some of the Boost modules.
> The numbers are staggering:

...

This is a legitimate concern which IMO has not been taken seriously by the
Boost community.

Why do you think this is not being taken seriously? I'm under the impression that there is disagreement as to the real priority of this concern. I think that is quite different.

I know I put forward several points that may well be perceived as not taking this seriously. I was genuinely asking why the OP cared about the number of included header files. As a technical question I still find it puzzling to find this a development priority since the solutions involve considerable development effort and a combinatorial explosion in deployed configurations. For these disadvantages, if I am to spend time reducing header file dependencies I require a good reason for expending this effort. Most arguments about this have centered around satisfying an existing arbitrary rule in other organisations. While I sympathise with the developers having to deal with these organisations, it is not compelling to prioritise this issue over and above the work I would be doing to extend Boost.Range or deal with trac tickets.

To some extent it's unavoidable since it's more efficcient to have one
"best" solution
imported everywhere rather than replecating code.

But it doesn't have to be as bad as it currently is. And it's only getting
worse as more libraries
get added to Boost.

Again why is it 'Bad'? This is re-use in most examples I have seen. Coupling isn't inherently bad. Re-use is coupling.

Specific examples of poor coupling most probably exist. I suspect these would receive a good response from the Boost development community if these were specifically raised.

Here's what I suggest for boost users such as yourself:

a) don't use "convenience headers" which suck in all the headers in a
library
rather than just the one's used. It seems tedious because one has to read
more carefully to know which headers to use but it saves huge amounts of
build time and diminishes "dependency surprises" which can cost a lot of
time to track down.

+1 - Definitely good advice. We Boost developers should provide the option for the users to make their own choice with respect to having a simple easy to remember include path with slower compile-times, or more specific includes. I believe we do that well.

b) a few libraries "infect" your code with "spreading" depencies. Use your
analysis to detect these libraries and complain about them. I have done
this in the past to no effect. It did make me feel I was doing something
though. If you can't get library authors to see this, you'll just have to
avoid
that library or fork your own copy.

Did you complain about them in trac tickets that have not been resolved? I definitely am guilty of recently having poor response times to trac tickets. This is something which I would like to assist in getting resolved. Of course, if the library maintainer disagrees that the header file inclusion or dependency is 'bad' then the usual rules apply that the maintainer has the final say. The benefit of avoiding design by committee comes at the cost of some disagreement.

c) In your own code, use pre-compiled headers. This speeds up re-builds.
It DOES force you to spend some time looking at how you've
divided up your own code but this time is a good investment.

I'm personally undecided about pre-compiled headers. I find this normally means that one has considerably increased the amount of included code per translation unit and therefore generated additional coupling that is far more significant than the perhaps sub-optimal inclusions within Boost.

Here's what I suggest library author's do:

a) Take this gentlement's complaint seriously.

I would be surprised if any of the Boost developers didn't take this seriously. My perception is that many of us are not convinced this is a priority. That isn't the same thing.

b) Consider eliminating "convenience" headers.

I think the Boost library authors are good at choosing the right level of headers to include. Do you have some specific counter-examples?

c) When writing documentation avoid depence on "convenience headers".
This seems like it adds some work - less conveniece - but hels addressing
this man's problem.

I definitely took the approach of using convenience headers in examples. I did this because the header files were not the primary focus of the example code. The convenience header in my examples provides code that works while reducing clutter in my example. I therefore disagree that we should have a policy of avoiding the convenience headers. Perhaps instead we could provide additional guidance as to how to use the layers of header files. However this seems trivially obvious.

Here's what Boost can do. Something like:

a) Formalize the comcept of boost library "levels"
i) core - e.g. config, auto-link, BOOST_NONCOPIABLE, ...
ii) utility - e.g. scoped_ptr, ...
iii) application support - e.g serialization

Layering the libraries is a good idea. It would appear to be the only manageable way of providing more deployment componentisation in sensible number of tested cohesive units.

Every boost component would depend only on components at the same or lower
level.
The assignment of library level would be part of the review process.

The motivation here is to support the future growth of Boost and C++
libraries in general.
I see this as the fundamental requirement behind any efforts to achive
"Boost Modularization"

Robert Ramey

Neil Groves

Robert Ramey

unread,

Sep 5, 2012, 10:13:07 PM9/5/12

to boost...@lists.boost.org

Neil Groves wrote:

> This is a legitimate concern which IMO has not been taken seriously
> by the
> Boost community.
>
> Why do you think this is not being taken seriously? I'm under the
> impression that there is disagreement as to the real priority of this
> concern. I think that is quite different.

I was specifically refering to the recent discussion of Boost Exception.
I got "out voted" on that one which I characterized as having my
"concerns not being taken seriously". If you prefer to say
"disagreement as to real priority" I'm fine with that. I don't think
we're disagreeing here.

> To some extent it's unavoidable since it's more efficcient to have one
> "best" solution imported everywhere rather than replecating code.
>
> But it doesn't have to be as bad as it currently is. And it's only
> getting worse as more libraries get added to Boost.

> Again why is it 'Bad'? This is re-use in most examples I have seen.
> Coupling isn't inherently bad. Re-use is coupling.

OK - maybe 'bad' is the wrong word. As new libraries get added
they depend on previous libraries. It's generally a good thing that code
not be repeated. But it makes it more likely that a library user is
importing stuff that might surprise him. This is unfortunate (bad) but
unavoidable. It's "bad" when the library author includes surprising
and or unnecessary dependencies. These bite the user with unexpected
behavior like
a) much bigger executables than expected
b) new requirements like multi-threading
c) breakage from updates of seemingly unrelated components.

> Specific examples of poor coupling most probably exist. I suspect
> these would receive a good response from the Boost development
> community if these were specifically raised.

In many cases there is disagreement what "poor coupling" is.

> Did you complain about them in trac tickets that have not been
> resolved?

This is not an issue regarding bugs. There are differences of
opinion regarding library design and approriate functionality
and requirements. This differences are reconciled as best
we can through the review process.

> b) Consider eliminating "convenience" headers.

> I think the Boost library authors are good at choosing the right
> level of headers to include. Do you have some specific
> counter-examples?

I don't think that "convenience headers" all that bad. As a user, I'm free
not to use them (and I don't). BUT -

a) I think they encourage practices among users that I don't think are good
ones.
b) I don't think they are really necessary. The serialization library
- a big library - has never had them and no one has ever asked for them.
c) It's trivial for users to make their own if they want them.

> I definitely took the approach of using convenience headers in
> examples. I did this because the header files were not the primary
> focus of the example code. The convenience header in my examples
> provides code that works while reducing clutter in my example.

d) I understand this. But I see it differently. If one has to use
convenience headers to make the examples simple enough to
understand, I would consider the possibility that the library
might be made clearer and easier to use by refactoring the
headers..

>Perhaps instead we could provide additional
> guidance as to how to use the layers of header files.

I think that is one of the things I'm proposing here.

> However this seems trivially obvious.

lol - then I guess it will be easy to agree on how to it

Robert Ramey

unread,

Sep 5, 2012, 10:34:40 PM9/5/12

to boost...@lists.boost.org

Lars Viklund wrote:
> On Wed, Sep 05, 2012 at 09:23:34AM -0800, Robert Ramey wrote:
> A few things that might make it slightly more accessible:
>
> * Have an "expand-all" button to expand everything, which would make
> the sidebar much more similiar to the typical Boost library landing
> page. For Serialization, expanding everything is at most two
> screenheights or so, which is quite manageable and scrollable

> * If the sidebar becomes hideable, the documentation pages themselves
> would need some prev/up/next navigation doohickeys.
>
> * If the sidebar becomes hideable, there ought to be a top-level page
> that contains the ToC.
>
> Of these things, the "expand-all" and bigger twiddle targets would
> likely be among the easier, while the other two would require a lot
> more work.

These are in fact not too hard. I'll consider it.

It turns out that I've been sucked in to spending time learning about
XSLT, XML editors, DocBooK, BoostBook. etc. So I've
considered re-formating the serialization documentation in terms
of BoostBook which would address all he the above and give
the serialization library documentation a more up date look
compatible with the other boost libraries.

BUT - I would lose the navigator - which I'm actually in love with.

What I would really like is an XSLT script which would build
a free-floating BoostBook Navigator which would navigate
all the boost book docs. Of course it's usage would be optional.

>> I included the "expando" after the fact since I found it excedingly
>> tedious to navigate complex documentation through the pages
>> themselves
>
> It's an interesting idea and helps a lot with having to go
> back-and-forth between the top-level page and any subpages of
> interest. The downside is that it's hard to open a part you're
> interested in a new tab/window and retain the sidebar frame.

Agreed. Actually it probably would be easy to implement a
right click functionality to open the selected page in another
tab or window.

So to summarize

a) a Navigator in a separate window which looks like
the table of contents panel in the serialzation libary
b) with expand/collapse functionality
c) which opens pages in a main window, new window or tab
d) is built using an XSLT script from BoostBook XML.
e) could be used to navigate all boost book documentation

> One of the bothersome things about S11n/Iostreams documentation is
> that it's impossible to link someone a subpage on IRC or on mail while
> retaining the sidebar.

Totally agree. My above idea would fix this since the navigator would
be in a separate window and not clobber the page's URL.

Robert Ramey

st...@parisgroup.net

unread,

Sep 5, 2012, 11:05:21 PM9/5/12

to boost...@lists.boost.org

Hi again,

I'm the original poster that started this thread. WOW! Thanks for all of the great responses. I apologize for posting this message and then getting called away on a business trip. It is only just now that I'm getting back to see what kind of response I got, and I'm thrilled. I'm happy to see that a number of folks involved with Boost see this issue as a significant problem, if only to certain types of companies.

APOLOGY: I must apologize for a small mistake in my numbers, that might be somewhat important to someone. I managed to reverse the counts for the "Smart Ptr" and "String Algo" libraries. I remember thinking it kinda strange that one referenced more modules but the other referenced more lines. So it's really true that "String Algo" causes 382 files to be read, while "Smart Ptr" causes only 180 to be read. Sorry about that.

I've taken a first pass through all the responses, and rather than respond to each of them individually, I'll offer some more information here and attempt to address address some of the questions that have been pointed back to me.

1) How did I get these numbers. Give some examples.

Here's one of the places I'd love to be shown to be wrong. If my numbers are inflated, my sales job to my boss will be that much easier. So by all means, someone correct me if my approach is unsound.

What I did was very simple. All I did was compile a very simple program and have g++ give me a list of all of the headers it read during the compilation, excluding system headers. This is done using the following line from my test Makefiles:

$(CXX) $(CPPFLAGS) $(CXXFLAGS) -c $< 2> /dev/null -MM > headers.lst

Here's the test for SmartPtr:

#include <iostream>
#include <boost/smart_ptr.hpp>
using namespace std;

int main(int argc, char* argv[])
{
return 0;
}

This simple test produces a file named headers.lst with 180 unique header paths in it, all starting with "boost/".

Discovering the modules used by each module took a few hours of fairly tedious labor, where I sorted and then grouped each list of headers, where each group consisted of headers coming from the same module.

2) Here are the specific module dependencies:

Any: "base", Config, Exception, MPL, Preprocessor, Static Assert, Type Traits, Utility

Filesystem: "base", Config, Exception, Functional, Integer, Iterators, MPL, Preprocessor, Smart Ptr, Static Assert, Type Traits, Utility

SmartPtr "base", Config, Exception, MPL, Preprocessor, Static Assert, Type Traits, Utility

StringAlgo: "base", Bind, Compatibility, Concept Check, Config, Exception, Function, Integer, Iterators, MPL, Preprocessor, Range, Static Assert, String Algo, Type Traits, Utility

3) In response to the suggestion to not use the convenience headers, like say "smart_ptr.hpp" as apposed to a header for an individual header type.

It's bad enough to tell my programmers they can only use certain Boost modules. To tell them that they can only use certain parts of certain Boost modules just gets to be too much. Plus, I can see eventually using most, if not all of the functionality of the SmartPtr module. The same can be said for the other modules I'm interested in. If I have to run to my boss every time I want to use one new particular feature from a Boost module, it's not worth the effort. Nor would it be worth the overhead of figuring out how to police such a level of code use.

So for better or worse, my consideration of the use of Boost has to be on a Module by Module basis.

4) In response to "the license says that it's free to use, and the copyright holders have agreed to that license, so everything is fine".

That's not true in the legal world. Neither the license nor any statements made by the person stating a copyright mean anything if that person somehow, if intentionally or unintentionally, included some bit of someone else's code in what they are calling their own. If the original writer of the code can prove original authorship of the code, nothing done without THAT PERSON'S GRANT OF LICENSE means anything. That original author owns all rights to the use of that code, and can dictate how it can and cannot be used. It is this issue that concerns companies like mine.

5) In response to "Who cares how much code there is. How does one "vet" a piece of code, regardless of how much of it there is".

It is not hard to look at 100 or 1000 lines of code in a few files and say "there's nothing novel here". If the code is all written to do one basic thing or set of things in a direct way, it's pretty easy to believe that a single or a few individuals wrote the code. And, if the claimed authorship is invalid, real damages would easy to justify as being minimal, given the very limited scope of what the code is capable of doing.

It's also much easier to feel comfortable in the fact that many other developers are using these 1000 lines of code in their commercial products and haven't yet been sued over the use of some portion of it. And if/when one wants to upgrade to the next version of a module consisting of 1000 lines of code, it's pretty easy to see what was added/removed.

But in the case of boost, with hundreds and possibly thousands (with fuller adoption) of individual files involved, consisting of tens to hundreds of thousands of lines of code, you can't have any idea what you've got In fact, you can feel fairly confident that all of those lines of code are NOT NECESSARY in the basic sense to the benefit you wish to gain from the module in question. So you have to ask yourself "what more does all this code do?", and you certainly can't read and understand the purpose of every line of such a quantity of code to answer that question And the fact that there's so much of it, leads one to wonder "what novel things might be going on in that code to require so much of it"? I mean, 384 header files being read for a Smart Ptr library is pretty darn "novel" in and of itself.

Finally, there's mere statistics involved. If 1000 lines of code opens a company to a certain amount of negative exposure, 100,000 lines of code, one might argue, opens the company to 100 times as much exposure.

6) And...I'm not sure this question was asked specifically, but I'll ask it myself..."what are you so worried about".

Here's an example of what we're worried about....

Say we develop a tool for Disney to use on one of its feature length films. A month before the premier date of the film, someone takes Disney to court and claims that one of their production tools, the one we wrote, contains code that was stolen from them. Disney asks us to come to court to defend our use of that code.

In the case of 1000 lines, we can say exactly what we did to vet the use of the code, and state exactly what that code does not just for us but for anyone who might use it, pointing out that each of those users has a very clear idea of what the code does, what it's worth to them, and why they considered the copyright given by the supposed author to be valid.

For 100,000 lines of code we say, well the 1% of the code we use kinda/sort works by doing this, but it does that by going off and using bits and pieces of all these other files, and frankly, we couldn't take the time to understand what all that code is for, and therefore can not possibly have understood that the code contained something novel that might have been misrepresented as to its authorship for reason of personal gain on the part of the offending copyright grantor.

In the first case, maybe the judge puts some value on the 1000 lines of code, and because it's Disney, that number gets multiplied by 10X. It's still a small amount of money for Disney, so they pay the money and just decide never to do business with us again.

In the second case, the judge says "wow, there's a lot of code here. This is going to take a lot of time to work out the ramifications of, and to put a dollar amount on" and files an injunction against Disney releasing their film. This costs Disney many millions of dollars on everything they've set in motion in order to release the film, that will now all be wasted money. Disney sues us for all of that money. We, as a very small software company, talk to our lawyer, who tells us our best bet is to fold the company and go find jobs working for Google.

7) Use a more modern C++

Some of our customers are in the Operating System Stone Age. For example, I often develop on Fedora, but my code has to be able to compile and run on Red Hat 5. AND, we are often told exactly what compiler to use, and that compiler sometimes not open source, and in a few cases no longer supported. So solving these issues with newer compilers is not an option.

8) Conclusion

So, it DOES MATTER, IN A BIG WAY how much code I have to bring into my project's codebase to get SmartPtr capabilities. And even if it turned out that it didn't, my boss doesn't consider it worth the risk to make that call. He'd rather hire another programmer just to write a SmartPtr library, so that our project can stay on schedule and he can sleep at night, knowing his company isn't going to some day go "poof" due to a relaxed approach to using Open Source.

Some of our customers don't allow their engineering departments any access to code on the internet for this very reason. There are firewalls designed solely to look for and disallow anything that looks like significant code or other data from coming into the company walls. We have to justify our use of each specific piece of Open Source to EACH OF THESE COMPANIES before we can begin to supply them with anything. So another big issue for us is that as soon as we say "We use Boost", we are dismissed from consideration for a project. I bet this happens all the time.

Thanks All for all the interesting and valuable discourse! Take care!

Steve

PS) My company DOES already use the Boost Smart Ptr library. However, it uses a much earlier version of the library, one that depends on just a dozen or so headers. So I guess at one point individual Boost modules were more separable. Or maybe it was just generally smaller back when that module was adopted. So we DO already have and use Boost Smart Ptrs...we just don't have all the nifty new features in the latest and greatest, the most important of which is the ability to not require that the pointed to class be defined wherever a Smart Ptr is instantiated. I'm dying for that feature.

Robert Ramey

unread,

Sep 6, 2012, 2:11:36 AM9/6/12

to boost...@lists.boost.org

st...@parisgroup.net wrote:
> 3) In response to the suggestion to not use the convenience headers,
> like say "smart_ptr.hpp" as apposed to a header for an individual
> header type.
>
> It's bad enough to tell my programmers they can only use certain
> Boost modules. To tell them that they can only use certain parts of
> certain Boost modules just gets to be too much.

No it's not. shared_ptr and scoped_ptr are really 2 different things
and shouldn't be treated as one.

> If I have to run to my boss every time I want to use
> one new particular feature from a Boost module, it's not worth the
> effort. Nor would it be worth the overhead of figuring out how to
> police such a level of code use.
>
> So for better or worse, my consideration of the use of Boost has to
> be on a Module by Module basis.

Why not take your argument to it's logical conclusion. Make one
header - boost.hpp - and you've only got one module to worry about.

Either accept the code bloat or bring in only what you need.
You can't have it both ways.

> 4) In response to "the license says that it's free to use, and the
> copyright holders have agreed to that license, so everything is
> fine".

> 5) In response to "Who cares how much code there is. How does one
> "vet" a piece of code, regardless of how much of it there is".
>

Boost has done all it can to address the problems created by
the legal system. It's not really fair nor realistic for us to more
than we already have. For the large majority of companies
large and small, this has been enough

> 6) And...I'm not sure this question was asked specifically, but I'll
> ask it myself..."what are you so worried about".

...
lol

> 7) Use a more modern C++
>
> Some of our customers are in the Operating System Stone Age. For
> example, I often develop on Fedora, but my code has to be able to
> compile and run on Red Hat 5. AND, we are often told exactly what
> compiler to use, and that compiler sometimes not open source, and in
> a few cases no longer supported. So solving these issues with newer
> compilers is not an option.
>
> 8) Conclusion
>

> He'd rather hire another programmer just to write
> a SmartPtr library, so that our project can stay on schedule and he
> can sleep at night,

He won't be able to make his schedule if he writes his own library.
He has no idea how much effort it takes to do such a thing.
He'll go broke doing this.

> So another big issue for us is that as soon as
> we say "We use Boost", we are dismissed from consideration for a
> project. I bet this happens all the time.

lol - maybe. But fiddling with headers is not going to change that.

Just rewrite the stuff yourself without looking at boost code
or documentation.

> PS) My company DOES already use the Boost Smart Ptr library.

just use the standard library header <memory>. But of course that
doesn't address your legal concerns.

John Maddock

unread,

Sep 6, 2012, 4:19:39 AM9/6/12

to boost...@lists.boost.org

>> Of these things, the "expand-all" and bigger twiddle targets would
>> likely be among the easier, while the other two would require a lot
>> more work.
>
> These are in fact not too hard. I'll consider it.
>
> It turns out that I've been sucked in to spending time learning about
> XSLT, XML editors, DocBooK, BoostBook. etc. So I've
> considered re-formating the serialization documentation in terms
> of BoostBook which would address all he the above and give
> the serialization library documentation a more up date look
> compatible with the other boost libraries.
>
> BUT - I would lose the navigator - which I'm actually in love with.
>
> What I would really like is an XSLT script which would build
> a free-floating BoostBook Navigator which would navigate
> all the boost book docs. Of course it's usage would be optional.

Nice idea. Robert: the HTML in your navigator pane looks rather similar to
the HTML in generated for the Boostbook TOC page, could your scripts be
injected in without too much change? If so we would just need to tweak HTML
generation to produce an extra TOC page along with the frames etc?

BTW I really don't like those tiny graphics either.

Also just noted that docbook can output web help, which looks rather nice:
http://docbook.sourceforge.net/release/xsl/current/webhelp/docs/content/ch01.html
but appears to require Java client side?

John.

Kerry, Richard

unread,

Sep 6, 2012, 6:15:28 AM9/6/12

to boost...@lists.boost.org

> but appears to require Java client side?

I don't think that's so.

DocBook/WebHelp requires JavaScript on the client side.
I think it requires Java only for the build (ie building the search indexes).

Regards,
Richard.

PS.
BTW, the starting point should be :
http://docbook.sourceforge.net/release/xsl/current/webhelp/docs/content/index.html

(or http://docbook.sourceforge.net/release/xsl/current/webhelp/docs/index.html, which redirects to the above).

> Also just noted that docbook can output web help, which looks
> rather nice:
> http://docbook.sourceforge.net/release/xsl/current/webhelp/docs/content/ch01.html
> but appears to require Java client side?

Robert Ramey

unread,

Sep 6, 2012, 1:29:35 PM9/6/12

to boost...@lists.boost.org

Robert Ramey

unread,

Sep 6, 2012, 1:53:56 PM9/6/12

to boost...@lists.boost.org

John Maddock wrote:
>>> Of these things, the "expand-all" and bigger twiddle targets would
>>> likely be among the easier, while the other two would require a lot
>>> more work.
>>
>> These are in fact not too hard. I'll consider it.
>>
>> It turns out that I've been sucked in to spending time learning about
>> XSLT, XML editors, DocBooK, BoostBook. etc. So I've
>> considered re-formating the serialization documentation in terms
>> of BoostBook which would address all he the above and give
>> the serialization library documentation a more up date look
>> compatible with the other boost libraries.
>>
>> BUT - I would lose the navigator - which I'm actually in love with.
>>
>> What I would really like is an XSLT script which would build
>> a free-floating BoostBook Navigator which would navigate
>> all the boost book docs. Of course it's usage would be optional.
>
> Nice idea. Robert: the HTML in your navigator pane looks rather
> similar to the HTML in generated for the Boostbook TOC page, could
> your scripts be injected in without too much change? If so we would
> just need to tweak HTML generation to produce an extra TOC page along
> with the frames etc?

What is the "Boostbook TOC page"? is that the TOC for each library?

In any case what is needed is an XSLT script which would process
all the BoostBook files and result in an HTML file which would
contain the navigator. This Navigator would include some boilerplate
javascript. Note that IO streams has a different (probably slicker)
implication of the navigator javascript.

a) It makes the navigator separate from the pages and doesn't
occupy any significant storage.
b) It's usage would be optional.
c) It preserves theURL of the origninal pages
d) It's cross platform
e) It lets us add a couple of cool simple features like opening
pages in other tabs.

I'm currently bogged down in trying to implement my boost
candidate/incubator concept I proosed at boost con a
couple of years ago. I is in the course of this that I've become
familiar with the pros/cons of docbook/boostbook.

> BTW I really don't like those tiny graphics either.

Halleluhuh - at last something easy to fix!!!

> Also just noted that docbook can output web help, which looks rather
> nice:
> http://docbook.sourceforge.net/release/xsl/current/webhelp/docs/content/ch01.html
> but appears to require Java client side?

Hmm - worthy of investigation. Any such Navigator is goint to require
Javascript.
I don't see this is a downside.

a) Javascript is pretty ubiquitous
b) The usage of the "navigator" is optional
c) It doesn't take away anything we already have.

My little experience with XSLT and DocBook suggests that, like everything
else, there's more to it than meets the eye. But only trying it can we
know.

Robert Ramey

Charles Mills

unread,

Sep 6, 2012, 1:21:41 PM9/6/12

to boost...@lists.boost.org

Thanks, Steve, for your EXCELLENT exposition in point 6 of the issues involved.

Are folks familiar with http://www.blackducksoftware.com/protex ? (I have no interest in Black Duck and have not myself ever used their products or services.)

The “offending” code unfortunately does not even have to come from the Internet:

Jones is working on a software project. He engages his buddy Smith to write portions of the code on a handshake sub-contractor basis. Jones subsequently contributes some of the code to Boost or another open source project, with all of the proper paperwork. Smith probably has a copyright claim on any code that uses the open source project.

Smith is a nice guy and told Jones “he would never sue anybody” but when he sees the name Disney the cash register in his mind goes ca-ching! He rationalizes suing on the basis that Disney (or fill in your favorite corporation) is part of the evil empire.

Charles

From: boost-use...@lists.boost.org [mailto:boost-use...@lists.boost.org] On Behalf Of st...@parisgroup.net
Sent: Wednesday, September 05, 2012 8:05 PM
To: boost...@lists.boost.org
Subject: Re: [Boost-users] Why is there so much co-dependency in Boost? Is there anything to be done about it?

Hi again,

Alex Perry

unread,

Sep 6, 2012, 1:40:26 PM9/6/12

to boost...@lists.boost.org

On 06 September 2012 18:54 Robert Ramey [mailto:ra...@rrsd.com] wrote :-

>
> Hmm - worthy of investigation. Any such Navigator is goint to require
> Javascript.
> I don't see this is a downside.
>
> a) Javascript is pretty ubiquitous
> b) The usage of the "navigator" is optional
> c) It doesn't take away anything we already have.
>

According to http://www.boost.org/development/requirements.html#Documentation
" The format for documentation should be HTML, and should not require an advanced browser or server-side extensions. Style sheets are acceptable. ECMAScript/JavaScript is not acceptable."

Not sure quite why such a explicit refusal of javascript but I imagine there were (at least) good reasons for this so may be worth some discussion to see if the reasons are still extant before doing any work.

Alex

Robert Ramey

unread,

Sep 6, 2012, 3:06:14 PM9/6/12

to boost...@lists.boost.org

Alex Perry wrote:
> On 06 September 2012 18:54 Robert Ramey [mailto:ra...@rrsd.com] wrote

> According to
> http://www.boost.org/development/requirements.html#Documentation "
> The format for documentation should be HTML, and should not require
> an advanced browser or server-side extensions. Style sheets are
> acceptable. ECMAScript/JavaScript is not acceptable."
>
> Not sure quite why such a explicit refusal of javascript but I
> imagine there were (at least) good reasons for this so may be worth
> some discussion to see if the reasons are still extant before doing
> any work.

That is/was the original policy from many years ago. In the case of the
navigators for serialzation and io streams, no one has requested
enforcement of this policy.

Basically this should be modified to permit javascript compatible
with know browsers.

BUT - if the "navigator" is factored out as a seperate, optional,
boost book compatible component, the original policy isn't
violated any more as the original boost documentation
web pages wouldn't contain any javascript. Maybe another
reason why this would be a good idea.

Robert Ramey

Sebastian Redl

unread,

Sep 7, 2012, 2:59:36 AM9/7/12

to boost...@lists.boost.org

On 06.09.2012 19:21, Charles Mills wrote:

The “offending” code unfortunately does not even have to come from the Internet:

Jones is working on a software project. He engages his buddy Smith to write portions of the code on a handshake sub-contractor basis. Jones subsequently contributes some of the code to Boost or another open source project, with all of the proper paperwork.

And all of the proper style normalization.

Smith probably has a copyright claim on any code that uses the open source project.

Smith is a nice guy and told Jones “he would never sue anybody” but when he sees the name Disney the cash register in his mind goes ca-ching! He rationalizes suing on the basis that Disney (or fill in your favorite corporation) is part of the evil empire.

I still wonder how code review, even of a thousand lines of code, is going to make a difference here. When Jones submitted his thing to Boost (or any other open source project), he probably went over it and made small changes to conform to style guidelines. Maybe Jones and Smith knew each other from college and had the same teachers and worked on projects together, so their coding styles are very similar in the first place. How is any reviewer (who doesn't even know Smith exists) going to notice that the code isn't all Jones's to begin with?

I'm not asking this just to be ornery. I'm actually genuinely curious if somebody thinks there is any technical merit to this process, or if it's all just legal window dressing.

On another legal note, wouldn't Jones be liable if he published code that isn't his under an open-source license? The license says that the author cannot be held liable for any damages coming from using the code, but in this case, Jones doesn't have the copyright, so he can't release the code under the license, so the license wouldn't apply.
So I wonder if a company that got sued over code that is in Boost but wasn't properly released by the actual author but instead by someone else could claim that they used the code in good faith and forward the blame to whoever released it.
(Which probably wouldn't help Disney much if their release is delayed, but it might make Smith think twice about suing if doing so would get Jones into trouble.)

Sebastian

Nevin Liber

unread,

Sep 7, 2012, 10:30:12 AM9/7/12

to boost...@lists.boost.org

On 5 September 2012 22:05, st...@parisgroup.net <st...@parisgroup.net> wrote:

Here's an example of what we're worried about....

Say we develop a tool for Disney to use on one of its feature length films. A month before the premier date of the film, someone takes Disney to court and claims that one of their production tools, the one we wrote, contains code that was stolen from them. Disney asks us to come to court to defend our use of that code.

How are you vetting your compiler, OS, etc.?

--
Nevin ":-)" Liber <mailto:ne...@eviloverlord.com> (847) 691-1404

Chris Cleeland

unread,

Sep 7, 2012, 11:17:36 AM9/7/12

to boost...@lists.boost.org

On Fri, Sep 7, 2012 at 9:30 AM, Nevin Liber <ne...@eviloverlord.com> wrote:

On 5 September 2012 22:05, st...@parisgroup.net <st...@parisgroup.net> wrote:

Here's an example of what we're worried about....

Say we develop a tool for Disney to use on one of its feature length films. A month before the premier date of the film, someone takes Disney to court and claims that one of their production tools, the one we wrote, contains code that was stolen from them. Disney asks us to come to court to defend our use of that code.

How are you vetting your compiler, OS, etc.?

Correct. One might be tempted to say, "that's absurd", but given the level of concern raised over using a toolkit that is clearly licensed and clearly copyrighted, this seems a legitimate concern if one is using a compiler such as gcc. Or an OS like Linux.

For that matter, what's to prevent a similar doomsday scenario from occurring if you were to purchase a license to use a 3rd party library, and that 3rd party library is found to contain infringing code? Where does tort law stop? It would seem that the infringement would be on the 3rd party and not on good-faith users of the 3rd party code.

IANAL, but I would think that whatever would apply in the 3rd party purchased software situation would similarly apply with boost--as long as your usage model is the same, i.e., you treat it as a 3rd party product and do not copy code from it. Once you copy the code, YOU would potentially be infringing.

Of course, the reality is that it may cost someone $$$$$ in legal bills to find out that you are right and the complainer is wrong, and a settlement would have been cheaper.

--
Chris Cleeland

Robert Ramey

unread,

Sep 7, 2012, 1:14:28 PM9/7/12

to boost...@lists.boost.org

Does anyone know of any of the following in europe or the US/Canada?

a) Has anyone been sued for incorporating Boost libraries in any software
product?
b) If the answer to the above is yes, has anyone ever won such a suit?
c) If the answer to the above is yes, has anyone ever had to pay damages?
d) If the answer to the above is yes, how much were the damages?

Robert Ramey

Charles Mills

unread,

Sep 7, 2012, 12:51:35 PM9/7/12

to boost...@lists.boost.org

The difference with a purchased product is that the license usually has some copyright and patent indemnification in it. I’m not being down on open source, it’s just a fact that one of the things you get for your license dollars is IP indemnification. What’s it worth? Your mileage may vary.

What about Linux? Well, Disney (in our example) is probably getting Linux directly from Red Hat or SUSE, not from the Boost-using software vendor, so it’s not his problem. (Yes, there IS a potential problem, as we saw with SCO v IBM.)

Yes, in my earlier note, isn’t Jones in trouble for representing the code as his own? Possibly, but that does not do Disney a whole lot of good. If he is a legally unsophisticated contract programmer he would have a good faith defense: “I paid Smith for the work and so I assumed I owned what he wrote.”

The key is in the last paragraph below. Being able to say “well, what about this factor? Doesn’t that change everything” doesn’t change anything. Disney is still going to have to pay a law firm big bucks to sort it all out, and it likely means the death of the Boost-using Disney software vendor. Again, I’m not trying to bad-mouth Boost or open source. I’m just reciting the sad facts of life in the big city in 2012.

> It would seem that the infringement would be on the 3rd party and not on good-faith users of the 3rd party code.

A safe harbor shield law would be a wonderful thing for the small and open source software community.

Charles

From: boost-use...@lists.boost.org [mailto:boost-use...@lists.boost.org] On Behalf Of Chris Cleeland
Sent: Friday, September 07, 2012 8:18 AM
To: boost...@lists.boost.org
Subject: Re: [Boost-users] Why is there so much co-dependency in Boost? Is there anything to be done about it?

On Fri, Sep 7, 2012 at 9:30 AM, Nevin Liber <ne...@eviloverlord.com> wrote:

Charles Mills

unread,

Sep 7, 2012, 12:54:18 PM9/7/12

to boost...@lists.boost.org

Has anyone read "The Black Swan"?

How many mortgage crashes have there ever been?

Charles

-----Original Message-----
From: boost-use...@lists.boost.org
[mailto:boost-use...@lists.boost.org] On Behalf Of Robert Ramey
Sent: Friday, September 07, 2012 10:14 AM
To: boost...@lists.boost.org
Subject: Re: [Boost-users] Why is there so much co-dependency in Boost? Is
there anything to be done about it?

Does anyone know of any of the following in europe or the US/Canada?

a) Has anyone been sued for incorporating Boost libraries in any software
product?
b) If the answer to the above is yes, has anyone ever won such a suit?
c) If the answer to the above is yes, has anyone ever had to pay damages?
d) If the answer to the above is yes, how much were the damages?

Chris Cleeland

unread,

Sep 7, 2012, 1:34:57 PM9/7/12

to boost...@lists.boost.org

On Fri, Sep 7, 2012 at 11:51 AM, Charles Mills <char...@mcn.org> wrote:

> It would seem that the infringement would be on the 3rd party and not on good-faith users of the 3rd party code.

A safe harbor shield law would be a wonderful thing for the small and open source software community.

Sounds like something a lobbying organization could take up. Any former software developers in congress?

--
Chris Cleeland

Charles Mills

unread,

Sep 7, 2012, 1:44:48 PM9/7/12

to boost...@lists.boost.org

> Any former software developers in congress?

Nope, all liability lawyers. Are you starting to see a thread here?

Charles

From: boost-use...@lists.boost.org [mailto:boost-use...@lists.boost.org] On Behalf Of Chris Cleeland
Sent: Friday, September 07, 2012 10:35 AM
To: boost...@lists.boost.org
Subject: Re: [Boost-users] Why is there so much co-dependency in Boost? Is there anything to be done about it?

On Fri, Sep 7, 2012 at 11:51 AM, Charles Mills <char...@mcn.org> wrote:

--
Chris Cleeland

Nevin Liber

unread,

Sep 7, 2012, 1:56:08 PM9/7/12

to boost...@lists.boost.org

On 7 September 2012 11:51, Charles Mills <char...@mcn.org> wrote:

What about Linux? Well, Disney (in our example) is probably getting Linux directly from Red Hat or SUSE, not from the Boost-using software vendor, so it’s not his problem. (Yes, there IS a potential problem, as we saw with SCO v IBM.)

Is Boost part of their distribution? If so, problem solved.

Reply all

Reply to author

Forward