From: Norman Geist (norman.geist_at_uni-greifswald.de)
Date: Thu Aug 21 2014 - 08:43:05 CDT
Ok forget for I said, on two Tesla C2050 along with 12 Xeon cores I get ~
0,034 s/step so your timings really look reasonable then, although
unfortunately the scaling from 2 to 10 GPUs is only ~50% which should be due
small system size.
;)
> -----Ursprüngliche Nachricht-----
> Von: owner-namd-l_at_ks.uiuc.edu [mailto:owner-namd-l_at_ks.uiuc.edu] Im
> Auftrag von Maxime Boissonneault
> Gesendet: Donnerstag, 21. August 2014 14:32
> An: Norman Geist
> Cc: Namd Mailing List
> Betreff: Re: AW: AW: namd-l: "Beefier" benchmark
>
>
> > Which combination brings improvement depends on system size. The
> smaller the
> > system, the better the benefit.
> > Keep the order x y z until benefit is gone. This needs to be tested
> for
> > different system sizes.
> >
> > Also check if you still get speedup while increasing the number of
> GPUs. You
> > might already scale out much earlier, as this timing still doesn't
> represent
> > 10 GPUs IMHO, guess 2 would do the same.
> What GPUs is your experience based on ? With 2 K20m GPUs, I got
> 0.021s/step at the fastest when using 20 CPU cores.
> With 8 K20m GPUs and 20 cores, I got 0.0075s/step, about 3 times
> faster.
> >
> > What value do you use for fullelectfrequency ?
> I use the default ApoA1 benchmark, downloaded from the namd website.
> Except that I added the twoawayx parameter you mentionned.
>
> Maxime
--- Diese E-Mail ist frei von Viren und Malware, denn der avast! Antivirus Schutz ist aktiv. http://www.avast.com
This archive was generated by hypermail 2.1.6 : Wed Dec 31 2014 - 23:22:44 CST