[ODE] Faster replacement for fastldlt.c

Henri Hakl henri at cs.sun.ac.za
Mon Nov 11 08:35:02 2002


This is a multi-part message in MIME format.

------=_NextPart_000_000A_01C289A8.C02B4F10
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

Hi ODE fans and perpetrators :)

I've been looking into various ways to optimize the LDLT factorization =
which is one of the computationally intensive tasks in ODE. =
Unforetunately I cannot directly test alot of my results due to not =
having a C(++) compiler around. Nonetheless, I've restructured the =
original "fastldlt.c" file which should be functionally equivalent to =
the original but considerably faster. (Russ... the version I send you =
had a bug in it - this version here should be correct.) I hope the =
version I provide is correct, this is difficult to ascertain just by =
inspection though, as the code is heavily interdependent.

Please can somebody that is "in the know" test the replacement file for =
correctness and speed. If it gets the thumbs-up I'd be happy to see it =
integrated into the ODE package. You can find the file at: =
http://www.cs.sun.ac.za/~henri/fastldlt_henri.c

thanks
  Henri

------=_NextPart_000_000A_01C289A8.C02B4F10
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=3DContent-Type content=3D"text/html; =
charset=3Diso-8859-1">
<META content=3D"MSHTML 6.00.2716.2200" name=3DGENERATOR>
<STYLE></STYLE>
</HEAD>
<BODY bgColor=3D#ffffff>
<DIV><FONT face=3DArial size=3D2></FONT>
<DIV><FONT face=3DArial>Hi ODE fans and =
perpetrators&nbsp;:)</FONT></DIV>
<DIV><FONT face=3DArial></FONT>&nbsp;</DIV>
<DIV><FONT face=3DArial>I've been looking into various ways to optimize =
the LDLT=20
factorization which is one of the computationally intensive =
tasks&nbsp;in ODE.=20
Unforetunately I cannot directly test alot of my results due to not =
having a=20
C(++) compiler around. Nonetheless, I've restructured the original =
"fastldlt.c"=20
file&nbsp;which should be functionally equivalent to the original but=20
considerably faster. (Russ... the version I send you had a bug in it - =
this=20
version here should be correct.) I hope the version I provide is =
correct, this=20
is difficult to ascertain just by inspection though, as the code is =
heavily=20
interdependent.</FONT></DIV>
<DIV><FONT face=3DArial></FONT>&nbsp;</DIV>
<DIV><FONT face=3DArial>Please can somebody that is "in the know" test =
the=20
replacement file for correctness and speed. If it gets the thumbs-up I'd =
be=20
happy to see it integrated into the ODE package. You can find the file =
at: <A=20
href=3D"http://www.cs.sun.ac.za/~henri/fastldlt_henri.c"><FONT=20
face=3D"Courier =
New">http://www.cs.sun.ac.za/~henri/fastldlt_henri.c</FONT></A></FONT></D=
IV>
<DIV><FONT face=3DArial></FONT>&nbsp;</DIV>
<DIV><FONT face=3DArial>thanks</FONT></DIV>
<DIV><FONT face=3DArial>&nbsp; Henri</FONT></DIV></DIV></BODY></HTML>

------=_NextPart_000_000A_01C289A8.C02B4F10--