Development/Tutorials/Common Programming Mistakes

    From KDE TechBase
    Revision as of 13:31, 15 April 2007 by Tampakrap (talk | contribs) (→‎Anonymous namespaces vs statics: typo: it's "a unique" since its pronounciation starts with a "j". :))
    Common Programming Mistakes
    Tutorial Series   Getting Started
    Previous   None
    What's Next   n/a
    Further Reading   n/a

    Abstract

    This tutorial aims to combine the experience of KDE developers regarding Qt and KDE frameworks dos and don'ts. Besides actual mistakes, it also covers things which are not necessarily "bugs" but which make the code either slower or less readable.

    General C++

    This section guides you through some of the more dusty corners of C++ which either tend to be misused or which people often simply get wrong.

    Anonymous namespaces vs statics

    If you have a method in a class that does not access any members and therefore does not need an object to operate, make it static. If additionally it is a private helper function that is not needed outside of the file, make it a file-static function. That hides the symbol completely.

    Symbols defined in a C++ anonymous namespace do not have internal linkage. Anonymous namespaces only give a unique name for that translation unit and that is it; they don't change the linkage of the symbol at all. Linkage isn't changed on those because the second phase of two-phase name lookup ignores functions with internal linkages. Also, entities with internal linkage cannot be used as template arguments.

    So for now instead of using anonymous namespaces use static if you don't want a symbol to be exported.

    NULL pointer issues

    First and foremost: it is fine to delete a null pointer. So constructs like this that check for null before deleting are simply redundant:

    if ( ptr ) {

      delete ptr;
    

    }

    When you delete a pointer, make sure you also set it to 0 so that future attempts to delete that object will not fail in a double delete. So the complete and proper idiom is:

    delete ptr; ptr = 0;

    You many notice that null pointers are marked variously in one of three ways: 0, 0L and NULL. The argument against using NULL was that while C defines it as a 0 void pointer, C++ defines it to not be a 0 void pointer. All conforming C++ implementations will define NULL correctly so it's really not a problem. The argument for 0L was that it was handled correctly in variable argument functions, while 0 wasn't. Nowadays that's also an artifact.

    It's more a question of personal style and getting used to something. As far as the code in KDE's SVN goes you'll see 0 used more commonly than NULL.

    Member variables

    You'll encounter four major styles of marking class member variables in KDE:

    • m_variable lowercase m, underscore and the name of the variable starting with a lowercase letter. This is the most common style and one prefered for code in kdelibs.
    • mVariable lowercase m and the name of variable starting with a uppercase letter
    • variable_ variable name starting with a lowercase letter and then an underscore
    • _variable underscore and the name of variable starting with a lowercase letter. This style is actually usually frowned upon as this notation is also used in some code for function parameters instead.

    As it often happens there's no one correct way of doing it, so remember to always follow the syntax used by the application/library to which you are committing.

    Static variables

    Try to limit the number of static variables used in your code, especially when committing to a library. Construction and initialization of large number of static variables really hurts the startup times.

    Do not use class-static variables, especially not in libraries and loadable modules though it is even discouraged in applications. Static objects lead to lots of problems such as hard to debug crashes due to undefined order of construction/destruction.

    Instead, use a static pointer, together with K_GLOBAL_STATIC which is defined in kglobal.h and is used like this:

    class A { ... };

    K_GLOBAL_STATIC(A, globalA)

    void doSomething() {

        A *a = globalA;
        ...
    

    }

    void doSomethingElse() {

       if (globalA.isDestroyed()) {
           return;
       }
       A *a = globalA;
       ...
    

    }

    void installPostRoutine() {

       qAddPostRoutine(globalA.destroy);
    

    }

    See the API documentation for K_GLOBAL_STATIC for more information.

    Forward Declarations

    You will reduce compile times by forward declaring classes when possible instead of including their respective headers. For example:

    1. include <QWidget> // slow
    2. include <QStringList> // slow
    3. include <QString> // slow

    class SomeInterface { public:

       virtual void widgetAction( QWidget *widget ) =0;
       virtual void stringAction( const QString& str ) =0;
       virtual void stringListAction( const QStringList& strList ) =0;
    

    };

    The above should instead be written like this:

    class QWidget; // fast class QStringList; // fast class QString; // fast class SomeInterface { public:

       virtual void widgetAction( QWidget *widget ) =0;
       virtual void stringAction( const QString& str ) =0;
       virtual void stringListAction( const QStringList& strList ) =0;
    

    };

    Iterators

    Prefer to use const_iterators over normal iterators when possible. Containers, which are being implicitly shared often detach when a call to a non-const begin() or end() methods is made (List is an example of such a container). When using a const_iterator also watch out that you're really calling the const version of begin() and end(). Unless your container is actually const itself this probably won't be the case, possibly causing an unnecessary detach of your container. So basically whenever you use const_iterator initialize them using constBegin()/constEnd() instead, to be on the safe side.

    Cache the return of the end() method call before doing iteration over large containers. For example:

    QValueList<SomeClass> container;

    //code which inserts a large number of elements to the container

    QValueListConstIterator end( container.end() );

    for ( QValueListConstIterator itr( container.begin() );

        itr != end; ++itr ) {
    

    }

    This avoids the unnecessary creation of the temporary end() return object on each loop iteration, largely speeding it up.

    Prefer to use pre-increment over post-increment operators on iterators as this avoids creating an unnecessary temporary object in the process.

    take care when erasing elements inside a loop

    When you want to erase some elements from the list, you maybe would use code similar to this:

    QMap<int, Job *>::iterator it = m_activeTimers.begin(); QMap<int, Job *>::iterator itEnd = m_activeTimers.end();

    for( ; it!=itEnd ; ++it ) {

       if(it.value() == job)
       {
           //A timer for this job has been found. Let's stop it.
           killTimer(it.key());
           m_activeTimers.erase(it);
       }
    

    }

    This code will potentially crash because it is a dangling iterator after the call to erase(). You have to rewrite the code this way: QMap<int, Job *>::iterator it = m_activeTimers.begin(); while (it != m_activeTimers.end()) {

       QMap<int, Job *>::iterator prev = it;
       ++it;
       if(prev.value() == job)
       {
           //A timer for this job has been found. Let's stop it.
           killTimer(prev.key());
           m_activeTimers.erase(prev);
       }
    

    } This problem is also discussed in the Qt documentation for QMap::iterator but applies to all Qt iterators

    Program Design

    In this section we'll go over some common problems related to the design of Qt/KDE applications.

    Delayed Initialization

    Although the design of modern C++ applications can be very complex, one recurring problem that is generally easy to fix is not using the technique of [delayed initialization].

    First, let's look at the standard way of initializing a KDE application:

    int main( int argc, char **argv ) {

       ....
       KApplication a;
    
       KCmdLineArgs *args = KCmdLineArgs::parsedArgs();
    
       MainWindow *window = new MainWindow( args );
    
       a.setMainWidget( window );
       window->show();
    
       return a.exec();
    

    }

    Notice that window is created before the a.exec() call that starts the event loop. This implies that we want to avoid doing anything non-trivial in the top-level constructor, since it runs before we can even show the window.

    The solution is simple: we need to delay the construction of anything besides the GUI until after the event loop has started. Here's how the example class MainWindow's constructor could look to achieve this:

    MainWindow::MainWindow() {

       initGUI();
       QTimer::singleShot( 0, this, SLOT(initObject()) );
    

    }

    void MainWindow::initGUI() {

       /* Construct your widgets here.  Note that the widgets you
        * construct here shouldn't require complex initialization
        * either, or you've defeated the purpose.
        * All you want to do is create your GUI objects and
        * QObject::connect
        * the appropriate signals to their slots.
        */
    

    }

    void MainWindow::initObject() {

       /* This slot will be called as soon as the event loop starts.
        * Put everything else that needs to be done, including
        * restoring values, reading files, session restoring, etc here.
        * It will still take time, but at least your window will be
        * on the screen, making your app look active.
        */
    

    }

    Using this technique may not buy you any overall time, but it makes your app seem quicker to the user who is starting it. This increased perceived responsiveness is reassuring for the user as they get quick feedback that the action of launching the app has succeeded.

    When (and only when) the start up can not be made reasonably fast enough consider using a KSplashScreen.

    Data Structures

    In this section we'll go over some our most common pet-peeves which affect data structures very commonly seen in Qt/KDE applications.

    Passing non-POD types

    Non "plain old data", or POD, types should be passed by const reference if at all possible. This includes anything other than the basic types such as char and int.

    Take, for instance, QString. They should always be passed into methods as const QString&. Even though QString is implicitly shared it is still more efficient (and safer) to pass const references as opposed to objects by value.

    So the canonical signature of a method taking QString arguments is:

    void myMethod( const QString & foo, const QString & bar );

    QObject

    If you ever need to delete a QObject derived class from within one of its own methods, don't ever delete it this way:

      delete this;
    

    This will sooner or later cause a crash because a method on that object might be invoked from the Qt event loop via slots/signals after you deleted it.

    Instead always use QObject::deleteLater() which tries to do the same thing as delete this but in a safer way.

    Empty QStrings

    It is common to want to test to see if a QString is empty. Here are three ways of doing it, the first two of which are correct:

    // Correct if ( mystring.isEmpty() ) { }

    // Correct if ( mystring == QString() ) { }

    // Wrong! "" if ( mystring == "" ) { }

    While there is a distinction between "null" QStrings and empty ones, this is a purely historical artifact and new code is discouraged from making use of it.

    QString and reading files

    If you're reading in a file it is faster to convert it from the local encoding to Unicode (QString) in one go rather than line by line. This means that methods like {qt|QIODevice}}::readAll() are often a good solution, followed by a single QString instantiation.

    For larger files consider reading a block of lines and then performing the conversion, that way you get the opportunity to update your GUI. This can be accomplished by reentering the event loop normally and using a timer to read in the blocks in the background or by creating a local event loop.

    While one can also use qApp->processEvents(), it is discouraged as it easily leads to subtle yet often fatal problems.

    QString and QByteArray

    While QString is the tool of choice for many string handling situations there is one where it is particularly inefficient. If you are pushing about and working on data in QByteArrays take care not to pass it through methods which take QString parameters and then make QByteArrays from them again.

    For example:

    QCString myData; QString myNewData = mangleData( myData );

    QString mangleData( const QString data ) {

       QCString str = data.toLatin1();
       // mangle 
       return QString(str);
    

    }

    The expensive thing happening here is the conversion to QString which does a conversion to Unicode internally. That is unnecessary as the first thing the method does is convert back using toLatin1(). So if you are sure that the Unicode conversion is not needed, try to avoid inadvertently using QString along the way.

    The above example should instead be written as:

    QCString myData; QCString myNewData = mangleData( myData );

    QCString mangleData( const QCString& data )

    QDomElement

    When parsing XML documents, one often needs to iterate over all the elements. You may be tempted to use the following code for that:

    for ( QDomElement e = baseElement.firstChild().toElement();

         !e.isNull();
         e = e.nextSibling().toElement() ) {
          ...
    

    }

    That is not correct though: the above loop will stop prematurely when it encounters a QDomNode that is something other than an element such as a comment.

    The correct loop looks like:

    for ( QDomNode n = baseElement.firstChild(); !n.isNull();

         n = n.nextSibling() ) {
       QDomElement e = n.toElement();
       if ( e.isNull() ) {
           continue;
       }
       ...
    

    }