A design problem I met again and again.

一首诗

unread,

Apr 1, 2009, 3:44:01 AM4/1/09

to

Hi all,

I am a programmer who works with some different kinds of programming
languages, like python, C++(in COM), action script, C#, etc.

Today, I realized that, what ever language I use, I always meet a same
problem and I think I never solve it very well.

The problem is : how to break my app into functional pieces?

I know it's important to break an application to lots of pieces to
make it flexible. But it's easier said than done. I can split an
application to 4 or 5 pieces based on "programming functions", for
example, logging, socket, string, math, ...

When it comes to the business logic, I found I always provide a big
class with many methods, and it grow bigger when new functions are
added.

Recently I use twisted to write a server. It has several protocol
classes which decode and encode different kinds of network protocols ,
and a protocol independent service class which handle request from
clients according to business logic.

Protocol classes receive message from client, decode it, call method
of service, encode result and send it back to client.

There are also some utility packages such as logging as I mentioned
before.

So far so fine, every thing is clear.

Until one day I find service has nearly 100 methods and 6000 lines of
code. I don't need to read any programming book to know that it's
too big.

But I can not find an easier way to split it. Here are some
solutions I found:

1. add several business classes, and move code in service into them.
But this means although service will contains much less code, it still
has to keep lots of methods, and the only functions of these methods
is call corresponding methods in business classes. The number of
methods in service will keep growing for ever.

2. completely move codes in service to business classes containing
only classmethods. These protocol classes calls these classmethods
directly instead of call service. But this pattern doesn't look that
OO.

3. completely move codes in service to business classes. Initialize
these classes and pass them to protocol classes.
These protocol classes calls these instances of business classes
instead of call service. These means whenever I add a new business
class. I have to add a parameter to __init__ methods of every
protocol class. Not very clear either.

==========================================

I got the same problem when writing C#/C++ when I have to provide a
lot of method to my code's user. So I create a big class as the entry
point of my code. Although these big classes doesn't contains much
logic, they do grow bigger and bigger.

Lawrence D'Oliveiro

unread,

Apr 1, 2009, 4:55:23 AM4/1/09

to

In message <48506803-a6b9-432b-acef-
b75f76...@v23g2000pro.googlegroups.com>, 一首诗 wrote:

> Until one day I find service has nearly 100 methods and 6000 lines of
> code. I don't need to read any programming book to know that it's
> too big.

The question is not how many lines or how many methods, but whether it makes
sense to remain as one piece or not. In one previous project, I had one
source file with nearly 15,000 lines in it. Did it make sense to split that
up? Not really.

andrew cooke

unread,

Apr 1, 2009, 6:40:02 AM4/1/09

to newp...@gmail.com, pytho...@python.org

一首诗 wrote:
> 3. completely move codes in service to business classes. Initialize
> these classes and pass them to protocol classes.
> These protocol classes calls these instances of business classes
> instead of call service. These means whenever I add a new business
> class. I have to add a parameter to __init__ methods of every
> protocol class. Not very clear either.

i don't fully understand your problem, but i would guess (3) is the
correct solution. you can probably avoid adding a new parameter by
writing code in a generic way (using lists of arguments, perhaps using
introspection to find method names, etc)

andrew

一首诗

unread,

Apr 1, 2009, 10:38:43 AM4/1/09

to

I also think that's my best choice. Before I wrote my mail, I
already knew that this is not a good question. It lacks details, and
it is too big.

But I think the first step to resolve a problem is to describe it. In
that way, I might find the answer myself

On Apr 1, 6:40 pm, "andrew cooke" <and...@acooke.org> wrote:

一首诗

unread,

Apr 1, 2009, 10:40:40 AM4/1/09

to

On Apr 1, 4:55 pm, Lawrence D'Oliveiro <l...@geek-
central.gen.new_zealand> wrote:
> In message <48506803-a6b9-432b-acef-

What are the average size of source files in your project? If it's
far lower than 15,000, don't feel it's a little unbalance?

Nick Craig-Wood

unread,

Apr 1, 2009, 2:30:05 PM4/1/09

to

一首诗 <newp...@gmail.com> wrote:
> But I think the first step to resolve a problem is to describe it. In
> that way, I might find the answer myself

:-) That is a great saying!

To answer your original question, split your code up into sections
that can be tested independently. If you can test code in a isolated
way then it belongs in a class / module of its own.

If you have a class that is too big, then factor independent classes
out of it until it is the right size. That is easier said than done
and may require some creativity on your part. It will pay dividends
though as the level of abstraction in your program will rise.

I've noticed some programmers think in big classes and some think in
small classes. Train yourself to do the other thing and your
programming will improve greatly!

--
Nick Craig-Wood <ni...@craig-wood.com> -- http://www.craig-wood.com/nick

Martin P. Hellwig

unread,

Apr 1, 2009, 3:14:59 PM4/1/09

to

一首诗 wrote:
<cut>

> But I think the first step to resolve a problem is to describe it. In
> that way, I might find the answer myself

<cut>
That is an excellent approach, knowing you have a problem and describing
it is actually the hardest part of a design, the rest is more like a puzzle.

What I guess so far is that you tried to (re)design your work by
grouping on functionality and using classes for more clearer work.
From what you wrote (that is if I understood you correctly), both of
these approaches don't really seem to get 'there'.

It might be worth to try another approach, instead of focussing on the
characteristics of the functions and using them as a guideline for your
design you could try this:

Step 1:
Write a Functional Design from a user perspective, restrain yourself
from implying anything technical or choosing specific tools. Imagine
yourself as an end-user and not as a developer.

Pick a random person of the street that looks literate but is not
working in IT (secretaries are usually great for this!), let them
comment on your language and then quiz them about the content to see if
they actually understood what you wrote.

If commenting on language seems strange, in my experience if I can't
properly describe what I want to say then there is a good chance that I
haven't thought about it sufficiently or I was lazy in describing it.

Step 2:
Take this functional design and write a functional specification.
This is much like the design but instead focusses on the business
processes and interdependencies of these. Write out implied constraints
and things you might think is obvious, although the specification are
technical in nature you should still avoid naming specific tools unless
it is to describe functionality, i.e. google like approach of indexing
data. Use plain English (or whatever language you want to write it in)
for this, don't use any diagrams, SQL table layouts, UML etc.

Pick a random IT related colleague (network administrators are usually
my preferred choice), let them read it and quiz them to make sure the
specification are clear enough.

Step 3:
When you have your functional specification, write a technical design.
Here you make a choice on the tools you are going to use based on
evidence based research and describe the general outline of your solution.

Pour your co-worker a nice cup of beverage of their choice and let them
read it and of course quiz them.

Step 4:
Finally, use the technical design for writing a technical specification.
Design you program using UML (or whatever thing that makes you look like
you are developing without writing code). Specify deep, down to the name
of all 'public' functions.

Step 5:
Let it rest for the weekend.

Step 6:
Reread your technical specification, if it still makes sense, continue.
If it doesn't, go back to step 1 and repeat the process with the changes
you made.

Step 7:
Do what you usually do (I write my unit-tests first and then solve them).

Step 8:
Compare the end product with your original functional design.
If they do not align go back to Step 1.

Some hints I found useful during step 4. I try to take in account that
it is not me who is going to develop it but a team of reasonable
qualified developers. Thus I split up the work in parts that can be
simultaneously done by more then one person without them needing to know
exactly what the other one is doing. If there is a need to know what the
other developer is doing then the specification was not precise enough.

If during the whole process something comes up that shows a better way,
change your documentation accordingly.

When all of this still results in an 'ugly' design, try letting more
people read your documentation, if that doesn't help then one or more of
the following may apply:
- Despite of its ugliness it is the most elegant design possible.
- You are working on something that is fundamentally broken.
- You haven't met the person that can give you more insight.

YMMV
--
mph

Carl Banks

unread,

Apr 1, 2009, 5:58:55 PM4/1/09

to

On Apr 1, 12:44 am, 一首诗 <newpt...@gmail.com> wrote:
> I got the same problem when writing C#/C++ when I have to provide a
> lot of method to my code's user. So I create a big class as the entry
> point of my code. Although these big classes doesn't contains much
> logic, they do grow bigger and bigger.

This seems to be a classic result of "code-based organization", that
is, you are organizing your code according to how your functions are
used. That's appropriate sometimes. Procedural libraries are often
organized by grouping functions according to use. The os module is a
good example.

However, it's usually much better to organize code according to what
data it acts upon: "data-based organization". In other words, go
though your big class and figure out what data belongs together
conceptually, make a class for each conceptual set of data, then
assign methods to classes based on what data the methods act upon.

Consider the os module again. It's a big collection of functions, but
there are a group of functions is os that all act on a particular
piece of data, namely a file descriptor. This suggests tha all the
functions that act upon file descriptors (os.open, os.close, os.seek,
etc.) could instead be methods of a single class, with the file
descriptor as a class member.

(Note: the os library doesn't do that because functions like os.open
are supposed to represent low-level operations corresponding to the
underlying system calls, but never mind that. Ordinarily a bunch of
functions operating on common data should be organized as a class.)

Carl Banks

Lawrence D'Oliveiro

unread,

Apr 2, 2009, 1:47:29 AM4/2/09

to

In message <158986a9-b2d2-413e-9ca0-
c58429...@f1g2000prb.googlegroups.com>, 一首诗 wrote:

Why?

Steven D'Aprano

unread,

Apr 2, 2009, 2:28:04 AM4/2/09

to

On Thu, 02 Apr 2009 18:47:29 +1300, Lawrence D'Oliveiro wrote:

>>> The question is not how many lines or how many methods, but whether it
>>> makes sense to remain as one piece or not. In one previous project, I
>>> had one source file with nearly 15,000 lines in it. Did it make sense
>>> to split that up? Not really.
>>
>> What are the average size of source files in your project? If it's
>> far lower than 15,000, don't feel it's a little unbalance?
>
> Why?

If you have too much code in one file, it will upset the balance of the
spinning hard drive platter, and it will start to wobble and maybe even
cause a head-crash.

--
Steven

Martin P. Hellwig

unread,

Apr 2, 2009, 4:50:33 AM4/2/09

to

Steven D'Aprano wrote:
<cut>

> If you have too much code in one file, it will upset the balance of the
> spinning hard drive platter, and it will start to wobble and maybe even
> cause a head-crash.
>

That is why proper designed operating systems, like windows 95,rarely
write one continuous block but spread the file all over the HD.

--
mph

Tim Rowe

unread,

Apr 2, 2009, 6:01:48 AM4/2/09

to pytho...@python.org

2009/4/1 一首诗 <newp...@gmail.com>:

> Hi all,
>
> I am a programmer who works with some different kinds of programming
> languages, like python, C++(in COM), action script, C#, etc.
>
> Today, I realized that, what ever language I use, I always meet a same
> problem and I think I never solve it very well.
>
> The problem is : how to break my app into functional pieces?

One approach is to go through the specification of the program,
underline all of the significant nouns and try to implement each of
the nouns as a class. That won't take you all the way to a good design
-- some of the resulting classes will be too trivial, and it won't
give you the derived classes you need, but it's a good first step to
breaking a problem down, and might help break your one big class
habit.

--
Tim Rowe