A new and efficient form of Featherstone’s multibody divide and conquer algorithm (DCA) is presented and evaluated. The DCA was the first algorithm to achieve theoretically the optimal logarithmic time complexity with a theoretical minimum of parallel computer resources for general problems of multibody dynamics; however, the DCA is extremely inefficient in the presence of small to modest parallel computers. This alternative efficient DCA (DCAe) approach demonstrates that large DCA subsystems can be constructed using fast sequential techniques to realize a substantial increase in speed. The usefulness of the DCAe is directly demonstrated in an application to a four processor workstation and compared with the results from the original DCA and a fast sequential recursive method. Previously the DCA was a tool intended for a future generation of parallel computers; this enhanced version delivers practical and competitive performance with the parallel computers of today.