TweetFollow Us on Twitter

FORTRAN Accuracy
Volume Number:8
Issue Number:5
Column Tag:Jörg's Folder

Accuracy & Speed in FORTRAN Compilers

MacFortran II compared with Language Systems Fortran on the Quadra.

By Jörg Langowski, MacTutor Regular Contributing Author

Note: Source code files accompanying article are located on MacTech CD-ROM or source code disks.

Absoft has recently announced MacFORTRAN II version 3.1, which is supposed to run very fast on Quadras. Since I reviewed Language Systems Fortran not long ago, I was curious how big the speed difference between these two compilers really was. So again, I went back to my benchmark programs. Two of them are (in)famous: the Whetstone benchmark, which tests primarily integer arithmetic and depends a lot on the efficiency of subroutine calling, because it works by calling lots of small routines repeatedly from a loop; and the Linpack, which tests the efficiency of floating point matrix operations using a package of linear algebra routines which are quite useful by themselves.

Furthermore, I found a third useful program between the demos included on the LS Fortran diskettes: it is called Paranoia (mentioned already in V8#1), and checks the consistency of the floating point arithmetic. Things are checked like whether 3*4 equals 4*3, 1+0 =1, 1*0 = 0 etc. These may seem trivial to you, but as you will see, they aren’t when you are dealing with Fortran compilers.

Numerical accuracy tests

I’d like to describe the Paranoia benchmark first, and its results on the two Fortran compilers. The program was original written in Basic and then ported to Fortran; the person who was responsible for the version I used is Richard Karpinski at the Computer Center of the University of California (San Francisco, CA 94143-0704). His description of the program follows:

===== PARANOIA =====

A test program that evaluates the quality of a numerics environment. Note that numerics quality depends on the compiler used as well as the underlying system hardware and software.

Paranoia is a rather large program, devised by Prof. Kahan of Berkeley, to explore the floating point system on your computer. These files are being redistributed as encouraged in the note copied below.

Paranoia.f single precision Fortran, 29 Jan. 1986

Dear Fellow Paranoid,

We ask:

1. Please distribute these programs as widely as you like. Be sure to include as much help as possible. Please include these requests as well.

2. Please let me know if you can provide it on other media. Other potential users may be saved from retyping 2500 lines of code if you can provide the programs on a Whizbang 200 disk/tape/notched-stick. It is reasonable to ask money for this.

3. PLEASE let me know about any source changes that you made to make Paranoia work on your system. The exact model/version/date of your system is quite important in order to understand the changes. If you send the new version, please indicate where the changes occur. Machine readable is preferred.

4. Please send me your results. Note which version of Paranoia gave the results and what machine/language model/version etc. they apply to.

5. Suggestions and comments are always welcome.

Thank you for participating in this unique investigation of contemporary floating-point arithmetic. Your help is vital to the project.

Yours truly,

Richard Karpinski

Modula Assured Quality Software, 6521 Raymond Street

Oakland, CA 94609

ps: Send a stamped, self addressed envelope to the above address to get a sheet on the current status of the project at any time.

The program is too long to print here (104 K, enclosed in compressed form on the source code disk). Just to give you a feeling for what it does, here are some typical lines of code:

C... LOOK FOR SOME OBVIOUS MISTAKES
 IF (0.0E0+0.0E0 .EQ. 0.0E0 .AND.
     1     1.0E0-1.0E0 .EQ. 0.0E0 .AND.
     1     1.0E0       .GT. 0.0E0 .AND.
     1     1.0E0+1.0E0 .EQ. 2.0E0) GOTO 930
 FAILS=FAILS+1
 WRITE(OUT,920)
920FORMAT(‘ FAILURE: violation of  0+0=0  or  1-1=0  or  
     1  1>0  or 1+1 = 2.’)

You wouldn’t believe that these expressions could be evaluated the wrong way, but there must be a reason why these tests are done

Less obvious are some other mistakes that are tested by Paranoia. For instance, the basic arithmetic operations should carry some ‘guard digits’, that is, some more digits that are the representation of the floating point number in memory. Also, they should be correctly rounded in a way that doesn’t introduce inconsistencies. It is not trivial to design floating point arithmetic in such a way that roundoff errors do not affect the consistency of the result. This is seen when the Paranoia benchmark is compiled with the Language Systems (v3.0) and Absoft (v3.1.2) compilers, using various settings of the optimizations and other options.

First, the code produced by the Language Systems compiler, at any optimization level, does not produce any error messages at all, so the arithmetic seems to behave in a consistent way. The rounding conforms to the IEEE p754 standard.

Compiling the benchmark with Absoft MacFortran 3.1.2, with and without the basic optimizations, produces a lot of error messages, like these:

(1-u1)-1/2 < 1/2 is false, so this program may malfunction.

Multiplication lacks a guard digit violating 1*x = x .

Division lacks a guard digit violating x/1 = x .

Computed value of 1/1.00...001 is not less than 1 .

The errors are typical of floating point implementations where roundoff is not handled correctly; it seems to be mainly due to the fact that Absoft tries to keep intermediate values in registers as long as possible. These errors are not as serious as they look, since they are mainly deviations of one unit in the last significant digit of the single-precision numbers used in the test. Some of the error messages are actually a consequence of the fact that the program computes the machine precision at the beginning, from the minimum distance of two floating point numbers close to 1.0. The Absoft-optimized code without any further options comes up with a much too small value for this distance, and therefore some later tests, where this machine precision is used, report errors as well.

All these considerations may seem quite academic to you, because we’re really dealing with very small differences between the computed and the ‘true’ result, of the order of 10-8 or so. But problems where the precise computation of a small difference between two large numbers is important are not that rare, and in those cases such errors will quickly blow up. It is for this reason that the IEEE standard for floating point arithmetic has been developed, and that Apple has implemented the SANE package which follows the IEEE proposal.

There is a remedy, however: one can force the Absoft compiler (with the -e option) to use extended precision in all subexpression calculations, and store values to memory from the registers and re-load them after every assignment. That seems to make the Paranoia benchmark work correctly, and - almost - no more errors are reported. Only now, there are some numbers x and y where x*y is not equal to y*x! (See listing - I reprinted the part of the Paranoia benchmark that reported the error). The only way around this error, and the way to make the benchmark run in a fully consistent fashion, is to compile the code non-optimized and with the -e option. In this case, however, Absoft MacFortran loses all its speed advantage over Language Systems Fortran.

Speed tests with Whetstone and Linpack

To compare the speed figures for the Whetstone and Linpack benchmarks, we should therefore use the Absoft compiler at least with the -e option, and maybe also drop the optimizations. Since now both LSF and ABF compilers exist in versions that support 68040 code, I’ve compared the benchmarks both on a Mac IIx and on a Quadra 700. LSF was always run at the highest optimization level, since the arithmetic seems to be OK at that level; for Absoft, I used the basic optimizations (-O) with and without the extended precision option (-e).

Linpack (single precision)

LSF 030/882 code (Mac IIx) 0.12 MFlops

(Quadra 700) 1.32 MFlops

LSF 040 code (Quadra 700) 1.37 MFlops

ABF 030/882 code -O (Quadra 700) 1.30 MFlops

ABF 030/882 code -O -e (Quadra 700) 1.27 MFlops

ABF 040 code -O (Quadra 700) 1.60 MFlops

ABF 040 code -O -e (Quadra 700) 1.58 MFlops

Whetstone (single precision)

LSF 030/882 code (Mac IIx) 974 K

(Quadra 700) 3976 K

LSF 040 code (Quadra 700) 3984 K

ABF 030/882 code -O (Mac IIx) 1215 K

ABF 030/882 code -O -e (Mac IIx) 955 K

ABF 030/882 code -O (Quadra 700) 4051 K

ABF 030/882 code -O -e (Quadra 700) 3829 K

ABF 040 code -O (Quadra 700) 5864 K

ABF 040 code -O -e (Quadra 700) 5381 K

The essence of these figures is that there is no significant execution speed difference between Absoft and Language Systems Fortran on a Mac IIx, if you make sure that both generate arithmetically consistent code. On the Quadra, however, the fact shows up that Language Systems really creates almost the same code for 68030 and 68040 systems; no additional speed is gained using the -68040 option, while Absoft gains 24% on the Linpack benchmark and 40% on the Whetstone benchmark when 68040 code generation is switched on. All in all, Absoft code on the Quadra is 15% faster than LSF code for the Linpack and 35% for the Whetstone benchmark.

If you are running Fortran on anything other than a Quadra, LSF is really the only choice, given its excellent support of the Macintosh user interface, System 7 features such as AppleEvents, 100% VAX compatibility going into such details as the syntax of I/O statements, and its documentation. For pure numeric performance on a Quadra, Absoft still features faster execution. Still, I’d like to see Absoft’s ‘threaded math library’ for the 68040 incorporated into LSF; that would be my dream system.

Forth news

For the Forth readers of this column, I recently got news from the creators of two of the best public domain Forth development systems for the Macintosh. You read in MacTutor, Vol. 8, No. 4, August 1992 issue about the update of Mops (2.1), the object-oriented Forth implemented by Michael Hore. He now sent me the tutorial that he is distributing together with Mops, a 340K Microsoft Word file.

I’m not going to print the Mops Tutorial here, obviously. But if you need it and don’t find it on any of the obvious sources (ftp from oddjob.uchicago.edu or sumex-aim.stanford.edu), drop me some bytes at langowski@frembl.bitnet and I’ll be happy to mail the file to you.

Another message came from Chris Heilman, who created Pocket Forth:

From: N%”heilman@PC.bitnet”

To: angowski@FREMBL51.bitnet

CC:

Subj: Apple Events in Pocket Forth

Joerg:

Hello again. How have things been going for you? I read your latest article in MacTutor’s June (congrats by the way on getting it back together) about Apple Events in LS fortran. I’ve been working along exactly those lines.

Just two weeks ago I completed and released Pocket Forth 6. While release 5 was capable of handling high-level events, PF6 makes it easy and fun. Two new words, AE: and ;AE are used to define event handlers. Another word, ,S takes a 4 character token from the input stream and puts a 32 bit number on the stack. Use them like this:

  ,s misc ,s dosc AE: blah, blah blah, .... ;AE

This installs the dosc event handler into a list. The dictionary is then saved. The next time the program starts, it handles dosc events. Of course the four required events are handled automatically, but their actions can be changed at any time.

But wait, that’s not all! I’ve added SANE floating point for any numeric token with an ‘E’ or decimal point, new icons, drag&drop file loading and a rewritten manual in TeachText format.

I’d like to send you a copy, so expect a disk soon. I’d email a copy to you, but our system has been choking on loooong mail lately, and I’ve gotten real frustrated sending 200K files 3 or 4 times at 2400 bps.

Later, Chris Heilman

I’m looking forward to seeing the new version of Pocket Forth. Expect some lines in this column when I’ve reviewed it.

Example: check if multiplication commutes

C
Cvery short main program
C
 CALL COMMUT(20)
 PAUSE
 END
C
C
 SUBROUTINE COMMUT( NUMTRY)
C
 Implicit None
 
 INTEGER  DEFECT
 REAL   ULPPLS, ULPMIN

 REAL FP0, FP1, FP2, FP3, HALF
C
 INTEGER NUMTRY
 REAL R9, X, X9, Y, Y9, Z, Z9
 INTEGER I, NN
C

 FP0 = 0
 FP1 = 1
 FP2 = FP1+FP1
 FP3 = FP2+FP1
 HALF = FP1 / FP2
 ULPMIN = 5.96046440E-08
 ULPPLS = 2.0*ULPMIN
 
2920  WRITE(*,2921) NUMTRY
2921  FORMAT(/’ Does multiplication commute?’,
     1   ‘ Testing if  x*y = y*x  for’, I4,’ random pairs:’)
2930  R9 =  SQRT(FP3)
 I = NUMTRY + 1
 X9 = FP0 / FP3
2960  CALL RANDOM (X, Y, X9, R9)
 Y9=X9
 CALL RANDOM (X, Y, X9, R9)
 Z=X9*Y9
 Y=Y9*X9
 Z9=Z-Y
 I=I-1
 IF (I .GT. 0 .AND. Z9 .EQ. FP0) GOTO 2960
2970  IF (I .GT. 0) GOTO 3000
2980  X9=FP0+HALF/FP3
 Y9=(ULPPLS+ULPMIN)+FP0
 Z=X9*Y9
 Y=Y9*X9
 Z9=(FP0+HALF/FP3)*((ULPPLS+ULPMIN)+FP0)
     1 -((ULPPLS+ULPMIN)+FP0)*(FP0+HALF/FP3)
 IF (Z9 .NE. FP0) GOTO 3000
 WRITE(*,2990) NUMTRY
2990  FORMAT(‘ No failure found in ‘,I4,’ randomly chosen pairs.’)
 RETURN
3000  DEFECT=DEFECT+1
 WRITE(*, 3001) X9, Y9
 WRITE(*, 3002) Z, Y, Z9
 NN=NUMTRY-I+1
 WRITE(*, 3003) NN
3001  FORMAT(‘ DEFECT:  x*y = y*x  violated at  x = ‘,E15.7,’, y = ‘,
     1 E15.7)
3002  FORMAT(‘  x*y =’,E15.7,’,  y*x =’,E15.7,’,  x*y-y*x =’,E15.7)
3003  FORMAT(‘    ... pair no.’, I4)
 RETURN
 END
C--
 SUBROUTINE RANDOM (X, Y, X9, R9)
 REAL X, Y, X9, R9
2950  X=X9+R9
 Y=X*X
 Y=Y*Y
 X=X*Y
 Y=X-AINT(X)
 X9=Y+X*.000005
 RETURN
 END

 
AAPL
$423.00
Apple Inc.
-8.77
MSFT
$34.59
Microsoft Corpora
-0.39
GOOG
$900.68
Google Inc.
+0.06

MacTech Search:
Community Search:

Software Updates via MacUpdate

Apple Java 2013-004 - For OS X 10.7 and...
Apple Java for OS X 2013-004 supersedes all previous versions of Java for OS X. This release updates the Apple-provided system Java SE 6 to version 1.6.0_51 and is for OS X versions 10.7 or later.... Read more
Google Chrome 27.0.1453.116 - Modern and...
Google Chrome is a Web browser by Google, created to be a modern platform for Web pages and applications. It utilizes very fast loading of Web pages and has a V8 engine, which is a custom built... Read more
EarthDesk 6.2 - Striking animated image...
EarthDesk replaces your static desktop picture with a rendered image of Earth showing correct sun, moon and city illumination. With an Internet connection, EarthDesk displays near real-time global... Read more
Apple Configurator 1.3 - Configure and d...
Apple Configurator makes it easy for anyone to mass configure and deploy iPhone, iPad, and iPod touch in a school, business, or institution. Three simple workflows let you prepare new iOS devices... Read more
Apple Java for Mac OS X 10.6 Update 16 -...
Apple Java for Mac OS X 10.6 Update 16 delivers improved security, reliability, and compatibility by updating Java SE 6 to 1.6.0_51.Version Update 16: See http://support.apple.com/kb/HT5744 for more... Read more
Neat 4.0.3 - Digital filing system for r...
Neat (formerly NeatWorks) is a powerful scanning and digital filing system that enables you to scan and organize receipts, business cards, and documents. Unlike other scanning software, NeatWorks... Read more
Adobe Muse CC 5.0 - Design and publish H...
Adobe Muse enables designers to create websites as easily as creating a layout for print. Design and publish original HTML pages using the latest Web standards, and without writing code. Now in beta... Read more
Adobe Creative Cloud 1.0 - Everything ne...
Adobe Creative Cloud costs $49.99/month (or less if you're a previous Creative Suite customer). Creative Suite 6 is still available for purchase (without a monthly plan) if you prefer. Introducing... Read more
Adobe Flash Professional CC 13.0.0.759 -...
Flash Professional CC is available as part of Adobe Creative Cloud for as little as $19.99/month (or $9.99/month if you're a previous Flash Professional customer). Flash Professional CS6 is still... Read more
Adobe InCopy CC 9.0 - Create streamlined...
InCopy CC is available as part of Adobe Creative Cloud for as little as $19.99/month (or $9.99/month if you're a previous InCopy customer). InCopy CS6 is still available for purchase (without a... Read more

Latest Forum Discussions

See All

Calendars+ by Readdle Goes Free For A Ve...
Calendars+ by Readdle Goes Free For A Very Limited Time Posted by Andrew Stevens on June 19th, 2013 [ permalink ] Universal App - Designed for iPhone and iPad | Read more »
Modern Combat 4: Zero Hour Has A Meltdow...
Modern Combat 4: Zero Hour Has A Meltdown, Gets New Maps, Multiplayer Modes, and More Posted by Andrew Stevens on June 19th, 2013 [ permalink ] | Read more »
XCOM: Enemy Unknown – Commander’s Log: H...
Part of the series 148Apps Goes Deep on XCOM: Enemy Unknown I’m still haunted by visions of a parallel world (classified as Xbox 360) as it wasn’t long ago that I was in charge of the XCOM project and led a squadron of soldiers against an alien... | Read more »
Rovio Stars: The Angry Birds’ New Publis...
Rovio Entertainment, creators of Angry Birds, has a new publishing initiative called Rovio Stars that will see its first titles Icebreaker and Tiny Thief released soon. Kalle Kaivola, Senior Vice President of Product & Publishing at Rovio... | Read more »
Favorite Four: Soccer Games
As a soccer fan, I’m getting twitchy. The Confederations Cup might be helping a little, but I miss the English Premier League week in, week out. This is where I sink time into FIFA 13 on my console in order to counteract the problem. What about... | Read more »
Knights of Pen & Paper Adds More Dun...
Knights of Pen & Paper Adds More Dungeons and Loot In Free Update Posted by Andrew Stevens on June 19th, 2013 [ permalink ] | Read more »
Froot ‘n’ Nutz Review
Froot ‘n’ Nutz Review By Blake Grundman on June 19th, 2013 Our Rating: :: VISUALLY DICEYUniversal App - Designed for iPhone and iPad While Froot ‘n’ Nutz may not look very modern, it is very likable.   | Read more »
148Apps Goes Deep on XCOM: Enemy Unknown
XCOM: Enemy Unknown will be released tonight for iPad and iPhone. And we’re very excited. While XCOM isn’t the first console game to be ported over to iOS, it is one of the most ambitious. XCOM: Enemy Unknown while first released for XBox 360 and... | Read more »
A Cautionary Tail – An Interactive Book...
A Cautionary Tail – An Interactive Book That Teaches Self-Acceptance Posted by Andrew Stevens on June 19th, 2013 [ permalink ] | Read more »
XCOM: Enemy Unknown – Cheats, Tips, and...
The X-Com series, particularly the earlier games, are notoriously unforgiving. Although while XCOM: Enemy Unknown has been modernized, and is therefore more player friendly, it’s no slouch either. In fact, even on the Normal difficulty there’s a... | Read more »

Price Scanner via MacPrices.net

Smaller Tablets Forecast To Get Even More Popular...
The DisplaySearch Blog’s Richard Shim notes that tablet PCs with screen sizes smaller than 9 inches are currently forecast to account for 66% of tablet PC shipments for the year but that share is... Read more
Updated iPad Price Trackers
We’ve updated our iPad Price Tracker and our iPad mini Price Tracker with the latest information on prices and availability from Apple and other resellers. Read more
Apple refurbished iPod nanos available for $99
The Apple Store has Apple Certified Refurbished 16GB iPod nanos available for $99 including free shipping and Apple’s standard one-year warranty. That’s $50 off the cost of new nanos. All colors are... Read more
iFixIt Tears Down mid-2013 11.6-inch MacBook Air
iFixIt Chief Information Architect Miroslav Djuric says: The epic week of disassembly continues: Today, the MacBook Air 11″ found its way onto our teardown table and was soon just another Apple in... Read more
Mature Consumers Know When They Need a PC
Tech.Pinions’ Ben Bajarin sensibly observes that one of the fundamental characteristics of a mature market is mature consumers – mature in the sense that they know what they want and more importantly... Read more
Windows 8 Continues Ascension in User Popularity R...
Softpedia’s Bogdan Popa notes that Windows 8 is now the fourth most popular operating system in the world, and according to some new statistics, it continues to gain new users every day. Popa cites... Read more
Apple iOS and OS X Updates Put Bluetooth Smart Rea...
From its Worldwide Developers Conference last week, Apple announced unprecedented integration of Bluetooth technology into its operating systems – a move that sets the bar for Bluetooth integration... Read more
Buy a 13″ MacBook Pro, get AppleCare for as little...
Adorama has 13″ MacBook Pros bundled with 3-year AppleCare Protection Plans for as little as $40 extra (AppleCare has an MSRP of $249 for 13-inch MacBook Pros). Shipping is free, and Adorama charges... Read more
Updated MacBook Price Trackers
We’ve updated our MacBook Price Trackers with the latest information on prices, bundles, and availability on MacBook Airs, MacBook Pros, and the MacBook Pros with Retina Displays from Apple’s... Read more
Save $140 on the 15″ 2.3GHz MacBook Pro
B&H Photo has the 15″ 2.3GHz MacBook Pro on sale for $1659 including free shipping. Their price is $140 off MSRP. B&H will include free copies of Parallels Desktop, Bento Database, and LoJack... Read more

Jobs Board

*Apple* At-Home Team Manager - Apple (U...
Changing the world is all in a day's work at Apple . If you love innovation, here's your chance to make a career of it. You'll work hard. But the job comes with more than Read more
*Apple* Retail - Manager - Apple (Unite...
Job SummaryKeeping an Apple Store thriving requires a diverse set of leadership skills, and as a Manager, youre a master of them all. In the stores fast-paced, dynamic Read more
*Apple* - Solution Architect - CompuCom...
Job Location: US-TX-Dallas Posted Date: 4/18/2013 Overview: The Apple Solution Architect (SA) will be responsible for supporting pre-sales and post-sales solutions in Read more
*Apple* Support Technician; Mid-level -...
A Kforce client in Washington, DC area is seeking an Apple Support Technician. This contractor will have the following types of responsibilities including, but not Read more
Systems Engineer - *Apple* TV - Apple...
Job Summary The Apple TV team is looking for an experienced engineer with a passion for delivering first in class home entertainment solutions. The individual must be Read more
All contents are Copyright 1984-2011 by Xplain Corporation. All rights reserved. Theme designed by Icreon.