TweetFollow Us on Twitter

68881 Access
Volume Number:4
Issue Number:4
Column Tag:Forth Forum

Direct 68881 Floating Point Access

By Jörg Langowski, MacTutor Editorial Board, Grenoble, France

Being the proud owner of a MacII since the beginning of this year, I was - like so many others - disappointed at the relative slowness of the SANE package. Even with the floating point coprocessor, the speedup is only a factor of 5-10 compared to the Mac SE. The coprocessor itself allows you to go much faster than that, and it is Apple’s SANE implementation that slows down the operations.

The reason for this is, of course, that Apple is trying to guarantee the ‘compatibility’ of the SANE on the MacII with the old SANE implementation. That means that the results are supposed to be the same, down to the least significant bit (or even to the last guard bit?). The 68881 uses different algorithms for calculating transcendental functions than SANE does, therefore some of its built-in operations could not be used and had to be replaced by software. Of course, that slows down things a lot.

As a side note, I’m not at all in favor of such a strategy. Artificially restraining a high performance chip just because its results, although accurate enough, don’t match the old - also accurate - results to the last bit seems a little exaggerated. Stable numerical algorithms should take into account the possibility that the hardware changes slightly and the machine errors are different, and they should be immune against such changes; or at least the new SANE should have had an option built in that uses the 68881 directly! [Amen to that statement! -Ed]

Many of the development systems for the Macintosh now come with the possibility to generate code that supports the floating point coprocessor directly, many others don’t; working with Forth, we don’t have a real problem since we can easily redefine our floating point operators. This month I’ll show you how to do that.

The 68881

Let’s first have a look at the floating point coprocessor itself.

The 68881 is accessed from the 68020 in a special address area. When the 68020 encounters an instruction of the form $Fxxx (previously the F line trap), it will set its function code lines (pins FC0-FC2) all to high, indicating ‘CPU space’. It will then exchange information with the coprocessor’s internal registers to perform the floating point operation requested.

This information exchange occurs automatically, it is part of the 68020’s design; when using the floating point instructions, you don’t notice the communication between the two processors at all. The 68881 appears as an extension to the 68020, just as if we had a new set of registers available, with special instructions operating on them.

The registers FP0-FP7 can each hold one extended precision (80-bit) floating point number, in the format given by the IEEE standard: bits 0-63 contain the mantissa, 64-78 the 15-bit exponent offset by 16383, and bit 79 the sign. The instruction set of the 68881 allows you to do floating point operations on any or between any two of these registers, and transfer data between them, the 68020’s registers and memory.

Mach2’s assembler fully supports all 68881 instructions. The redefinition of the words in the SANE vocabulary is therefore quite straightforward. Register D7 is used by Mach2 as the floating point stack pointer, all we have to do is to transfer floating point numbers from the FP stack to the 68881’s registers, do the FP operation and transfer back the result.

The 68881 expects floating point numbers in a different format than SANE does. In order to keep FP numbers aligned to the boundaries of a 32-bit long word, which speeds up operations, an extended precision number in memory will be 96 bits long instead of 80; there is a 16 bit gap between the 64-bit mantissa and the 16-bit sign and exponent field. Since SANE and the floating point stack in Mach2 use 80 bit numbers, we have to convert their format when we use the coprocessor.

Listing 1 provides the macros f>2, f>1 and 1>f for this purpose. The general format for a binary floating point operation is then

 f>2  
 fop.x  FPn, FPm 
 1>f

where fop.x is an extended format floating point instruction that operates on floating point registers FPn and FPm. The example always uses registers FP0 and FP1. Unary operations are encoded the same way, using only FP0.

In the example, I provided a new vocabulary f68881 that contains redefinitions of the most important SANE operations for the 68881. Of course, the concept can be far extended. We have eight registers at our disposition and can use them to optimize more complicated numerical algorithms, using only assembly language. I might give some examples how to do this in a later column.

For the moment, let’s content ourselves with the speed improvement that we have achieved so far, which is already remarkable. Some simple benchmarks are listed at the end of listing 1, and the results are given here:

( MacII, direct 68881 access, 100000 loops each )

bmark1 2 Secs 18 Ticks ok <0> [0]

bmark2 4 Secs 48 Ticks ok <0> [0]

bmark3 4 Secs 14 Ticks ok <0> [0]

bmark4 4 Secs 17 Ticks ok <0> [0]

bmark5 4 Secs 14 Ticks ok <0> [0]

( MacII, SANE w/68881, 10000 loops each )

smark1 15 Ticks ok <0> [0]

smark2 37 Secs 12 Ticks ok <0> [0]

smark3 24 Ticks ok <0> [0]

smark4 1 Secs 33 Ticks ok <0> [0]

smark5 1 Secs 33 Ticks ok <0> [0]

( Mac +, SANE, 10000 loops each )

ok <0> [0]

smark1 24 Ticks ok <0> [0]

smark2 181 Secs 54 Ticks ok <0> [0]

smark3 49 Ticks ok <0> [0]

smark4 4 Secs 47 Ticks ok <0> [0]

smark5 5 Secs 14 Ticks ok <0> [0]

For the Mac+, the fnull1 and fnull2 operations had been replaced by simple fdrops. As you see, the speedup going from Mac+ to MacII’s SANE is not so breathtaking: a factor of 6 for the exponential, 4 for addition and subtraction; but when we access the 68881 directly, we gain another factor of 3 for simple addition and multiplication and 78 for the exponential. It is in the calculation of the transcendentals where the 68881 really shines.

Pop up menus

Someone approached me lately on the question of how to do pop up menu selection. Since the technical notes contain only sketchy references to popup menus at the time I write this (e.g. TN156, TN172), I’d like to give you a practical example how to use pop up menus from Mach2 in a simple way.

Listing 2 explains the process. The PopUpMenuSelect trap takes four parameters:

- a handle to the menu to be displayed (32 bits),

- the top and left global coordinate of the point at which to display the menu (2*16 bits),

- the menu item which should be positioned at that point for the default selection.

Although Mach2 knows the trap name, the interface to this routine is not (yet) correct, so we have to redefine it in assembler. Note that the point returned by the @mouse function is in local coordinates, while PopUpMenuSelect expects it in global coordinates.

The example defines a menu using the Mach2 interface; the menu is created with -1 as the insertion parameter (-1 150 mymenu BOUNDS) so that after insertion into the tasks’s menu bar the menu will stay invisible (just as we did for the hierarchical menus). Note that a pop up menu has to be inserted into the menu bar before using it.

The content handler for the default Mach2 window is then rewritten so that on a mouse down event it will select the example pop up menu. The menu handler will just beep a number of times depending on the item selected. dopop activates the new content handler while nopop deactivates it.

Feedback dept.

This letter comes from Vassili Dzuba, Paris:

“In January’s issue ‘Mousehole Report’, Alan Dall put in his wish list the ability to define ‘ghost copies’ of applications. Even without Unix’ capability of defining links, this can be done with a small program using the _Launch trap. This program takes only 1K on the disk. The path name of the application to launch is stored in ‘STR ‘ resource 1000. It’s possible to set the creator and the bundle bit of the ghost to have it share the same icon as the real thing. Of course, double-clicking on a document can then launch the ghost instead of the application, but the slowing down is only marginal.

The program is the following

(using MPW’s assembler):

 INCLUDE‘traps.a’
ghost MAIN
 MOVE.W #0,-(SP) 
 ; the context data (_launch parameter)
 MOVE.L #’STR ‘,-(SP)
 ; 1st parameter of GetResource
 MOVE.W #1000,-(SP)
 ; 2nd parameter of GetResource
 _GetResource
 ; handle to string in (SP)
 TST.L  (SP)
 ; test if null handle (no resource available)
 BEQ.B  exit
 ; if null, go to exit
 MOVE.L (SP),A0
 MOVE.L (A0),(SP)
 ; handle dereferenced
 MOVE.L SP,A0
 ; stack pointer in A0
 _Launch
exit  _ExitToShell
 END

The resource file is something like this (using Rez format):

#include “types.r”
resource ‘STR ‘ (1000) {
 “Sys:myDirectory:MyApp”
};

A ghost can be easily created using a small shell script (assuming the original ghost’s directory being {MPW}dev) which sets up the string resource:

duplicate {MPW}’dev:ghost’ {2}
echo ‘#include “types.r”’n
‘resource ‘’’STR ‘’’ (1000) { “‘{1}’” };’
| rez -a -o {2}

Assuming this script is named ‘summon’, the creation in the current directory of a ghost of MacPaint would be something like:

summon ‘sys:appli ƒ:mac paint’ ‘macpaint.Ghost’

Sincerely yours”

Thank you, Vassili, for this helpful little utility.


Listing 1: direct access 68881 floating point words for Mach2
\ 68881 access, © J. Langowski/MacTutor Jan 1988

only forth also assembler also sane
vocabulary f68881
also f68881 definitions

code f>2
 add.l  #20,d7
 move.l d7,a0
 move.l -(a0),-(a7); move mantissa
 move.l -(a0),-(a7); in two 32-bit chunks
 subq.l #2,a7    ; 16-bit gap
 move.w -(a0),-(a7); move exponent + sign
 fmove.x(a7)+,fp0; transfer from stack to fp0
 move.l -(a0),-(a7); same for fp1...
 move.l -(a0),-(a7)
 subq.l #2,a7
 move.w -(a0),-(a7)
 fmove.x(a7)+,fp1
 add.l  #10,a0 ; a0 points to 2nd fp stack item
 rts
end-code mach

code f>1
 add.l  #10,d7
 move.l d7,a0
 move.l -(a0),-(a7)
 move.l -(a0),-(a7)
 subq.l #2,a7
 move.w -(a0),-(a7)
 fmove.x(a7)+,fp0
 rts
end-code mach

code 1>f
 fmove.xfp0,-(a7)
 move.w (a7)+,(a0)+; transfer exponent + sign
 addq.l #2,a7    ; skip16 bit gap
 move.l (a7)+,(a0)+; transfer mantissa 
 move.l (a7)+,(a0)+; in 2 steps
 sub.l  #10,d7   ; adjust FP stack pointer
 rts
end-code mach

( note: f>1 or f>2 and 1>f should always occur in pairs since 1>f expects 
A0 to point to second floating point stack position; this is assured 
by f>1 and f>2 )

code f+
 f>2
 fadd.x fp1,fp0
 1>f
 rts
end-code

code f-
 f>2
 fsub.x fp1,fp0
 1>f
 rts
end-code

code f/
 f>2
 fdiv.x fp1,fp0
 1>f
 rts
end-code

code f*
 f>2
 fmul.x fp1,fp0
 1>f
 rts
end-code

code fmod
 f>2
 fmod.x fp1,fp0
 1>f
 rts
end-code

code frem
 f>2
 frem.x fp1,fp0
 1>f
 rts
end-code

code fabs
 f>1
 fabs.x fp0
 1>f
 rts
end-code

code facos
 f>1
 facos.xfp0
 1>f
 rts
end-code

code fasin
 f>1
 fasin.xfp0
 1>f
 rts
end-code

code fatan
 f>1
 fatan.xfp0
 1>f
 rts
end-code

code fatanh
 f>1
 fatanh.x fp0
 1>f
 rts
end-code

code fcos
 f>1
 fcos.x fp0
 1>f
 rts
end-code

code fcosh
 f>1
 fcosh.xfp0
 1>f
 rts
end-code

code fe^x
 f>1
 fetox.xfp0
 1>f
 rts
end-code
 
code fe^x-1
 f>1
 fetoxm1.x fp0
 1>f
 rts
end-code

code fgetexp
 f>1
 fgetexp.x fp0
 1>f
 rts
end-code

code fgetman
 f>1
 fgetexp.x fp0
 1>f
 rts
end-code

code fint
 f>1
 fint.x fp0
 1>f
 rts
end-code

code fintrz
 f>1
 fintrz.x fp0
 1>f
 rts
end-code

code fln
 f>1
 flogn.x  fp0
 1>f
 rts
end-code

code fln+1
 f>1
 flognp1.x fp0
 1>f
 rts
end-code

code flog10
 f>1
 flog10.x fp0
 1>f
 rts
end-code

code flog2
 f>1
 flog2.x  fp0
 1>f
 rts
end-code

code fneg
 f>1
 fneg.x fp0
 1>f
 rts
end-code

code fsin
 f>1
 fsin.x fp0
 1>f
 rts
end-code

code fsinh
 f>1
 fsinh.x  fp0
 1>f
 rts
end-code

code fsqrt
 f>1
 fsqrt.x  fp0
 1>f
 rts
end-code

code ftan
 f>1
 ftan.x fp0
 1>f
 rts
end-code

code ftanh
 f>1
 ftanh.x  fp0
 1>f
 rts
end-code

code f10^x
 f>1
 ftentox.x fp0
 1>f
 rts
end-code

code f2^x
 f>1
 ftwotox.x fp0
 1>f
 rts
end-code

fp

also forth definitions
code fnull1
 f>1
 1>f
 rts
end-code

code fnull2
 f>2
 1>f
 rts
end-code

: bmark1 counter 1.0 
 100000 0 do fdup fnull1 fdrop loop 
timer fdrop ;

: bmark2 counter 4.3352 
 100000 0 do fdup fe^x fdrop loop 
timer fdrop ;

: bmark3 counter 
 3.5 4.5 100000 0 do fover fover fnull2 fdrop loop 
timer fdrop fdrop ;

: bmark4 counter 
 3.5 4.5 100000 0 do fover fover f+ fdrop loop 
timer fdrop fdrop ;

: bmark5 counter 
 3.5 4.5 100000 0 do fover fover f* fdrop loop 
timer fdrop fdrop ;

also sane

: smark1 counter 
 1.0 10000 0 do fdup fnull1 fdrop loop 
timer fdrop ;

: smark2 counter 
 4.3352 10000 0 do fdup fe^x fdrop loop 
timer fdrop ;

: smark3 counter 
 3.5 4.5 10000 0 do fover fover fnull2 fdrop loop 
timer fdrop fdrop ;

: smark4 counter 
 3.5 4.5 10000 0 do fover fover f+ fdrop loop 
timer fdrop fdrop ;

: smark5 counter 
 3.5 4.5 10000 0 do fover fover f* fdrop loop 
timer fdrop fdrop ;
Listing 2: Pop up menus
\ popup menu interface from Forth
\ © J. Langowski/MacTutor Jan 1988

\ FUNCTION PopUpMenuSelect 
\ (menu: MenuHandle; top, left, popUpItem: INTEGER):
\LONGINT;
\  INLINE $A80B;

code popup ( hMenu top left item# | menuID item  -- )
 exg  d4,a7
 clr.l  -(a7)
 move.l 12(a6),-(a7) \ handle
 move.w 10(a6),-(a7) \ top
 move.w 6(a6),-(a7)\ left
 move.w 2(a6),-(a7)\ item
 adda.l #16,A6
 _popupmenuselect
 clr.l  d0
 clr.l  d1
 move.w (a7)+,d0
 move.w (a7)+,d1
 exg  d4,a7
 move.l d0,-(a6)
 move.l d1,-(a6)
 rts
end-code

CODE  @MOUSE
    SUBQ.L    #4,A6
    MOVE.L    A6,-(A7)
    _GETMOUSE
    RTS
END-CODE

CODE unpack
 MOVE.L (A6),D0
 CLR.L  D1
 MOVE.W D0,D1
 CLR.W  D0
 SWAP.W D0
 MOVE.L D0,(A6)
 MOVE.L D1,-(A6)
 RTS
END-CODE MACH

108 user taskmenubar
152 user content-hook

NEW.MENU myMenu
“ TestMenu” myMenu TITLE
-1 150 myMenu BOUNDS \ invisible menu
“ Item 1;Item 2;Item 3” myMenu ITEMS

taskmenubar @ mymenu add
call drawmenubar

: beep 
 ?dup if 
 0 do 5 call sysbeep loop 
 then
;

: content-handler { | pt -- }
 @mouse -> pt
 ^ pt call localtoglobal
 mymenu @
 pt unpack
 1
 popup
 beep drop
 run-content
;

: dopop [‘] content-handler content-hook ! ;
: nopop [‘] run-content content-hook ! ;
 
AAPL
$100.96
Apple Inc.
-0.83
MSFT
$47.52
Microsoft Corpora
+0.84
GOOG
$596.08
Google Inc.
+6.81

MacTech Search:
Community Search:

Software Updates via MacUpdate

Airfoil 4.8.9 - Send audio from any app...
Airfoil allows you to send any audio to AirPort Express units, Apple TVs, and even other Macs and PCs, all in sync! It's your audio - everywhere. With Airfoil you can take audio from any... Read more
WhatRoute 1.13.0 - Geographically trace...
WhatRoute is designed to find the names of all the routers an IP packet passes through on its way from your Mac to a destination host. It also measures the round-trip time from your Mac to the... Read more
Chromium 37.0.2062.122 - Fast and stable...
Chromium is an open-source browser project that aims to build a safer, faster, and more stable way for all Internet users to experience the web. FreeSMUG-Free OpenSource Mac User Group build is... Read more
Attachment Tamer 3.1.14b9 - Take control...
Attachment Tamer gives you control over attachment handling in Apple Mail. It fixes the most annoying Apple Mail flaws, ensures compatibility with other email software, and allows you to set up how... Read more
Duplicate Annihilator 5.0 - Find and del...
Duplicate Annihilator takes on the time-consuming task of comparing the images in your iPhoto library using effective algorithms to make sure that no duplicate escapes. Duplicate Annihilator detects... Read more
jAlbum Pro 12.2 - Organize your digital...
jAlbum Pro has all the features you love in jAlbum, but comes with a commercial license. With jAlbum, you can create gorgeous custom photo galleries for the Web without writing a line of code!... Read more
jAlbum 12.2 - Create custom photo galler...
With jAlbum, you can create gorgeous custom photo galleries for the Web without writing a line of code! Beginner-friendly, with pro results Simply drag and drop photos into groups, choose a design... Read more
Quicken 2015 2.0.4 - Complete personal f...
Quicken 2015 helps you manage all your personal finances in one place, so you can see where you're spending and where you can save. Quicken automatically categorizes your financial transactions,... Read more
iMazing 1.0 - Complete iOS device manage...
iMazing (formerly DiskAid) is the ultimate iOS device manager with capabilities far beyond what iTunes offers. With iMazing and your iOS device (iPhone, iPad, or iPod), you can: Copy music to and... Read more
Xcode 6.0.1 - Integrated development env...
Apple Xcode is Apple Computer's integrated development environment (IDE) for OS X. The full Xcode package is free to ADC members and includes all the tools you need to create, debug, and optimize... Read more

Latest Forum Discussions

See All

View Source – HTML, JavaScript and CSS...
View Source – HTML, JavaScript and CSS 1.0 Device: iOS Universal Category: Utilities Price: $.99, Version: 1.0 (iTunes) Description: View Source is an app plus an iOS 8 Safari extension that makes it easy to do one key web developer... | Read more »
Avenged Sevenfold’s Hail To The King: De...
Avenged Sevenfold’s Hail To The King: Deathbat is Coming to iOS on October 16th Posted by Jessica Fisher on September 19th, 2014 [ permalink ] Just in time for Halloween, on October 16 Avenged Sevenfold will be launching | Read more »
Talisman Has Gone Universal – Can Now be...
Talisman Has Gone Universal – Can Now be Played on the iPhone Posted by Jessica Fisher on September 19th, 2014 [ permalink ] | Read more »
Tap Army Review
Tap Army Review By Jennifer Allen on September 19th, 2014 Our Rating: :: SHOOT EM ALLUniversal App - Designed for iPhone and iPad Mindless but fun, Tap Army is a lane-based shooter that should help you relieve some stress.   | Read more »
Monsters! Volcanoes! Loot! Epic Island f...
Monsters! Volcanoes! Loot! | Read more »
Plunder Pirates: Tips, Tricks, Strategie...
Ahoy There, Seadogs: Interested in knowing our thoughts on all this plundering and pirating? Check out our Plunder Pirates Review! Have you just downloaded the rather enjoyable pirate-em-up Plunder Pirates and are in need of some assistance? Never... | Read more »
Goat Simulator Review
Goat Simulator Review By Lee Hamlet on September 19th, 2014 Our Rating: :: THE GRUFFEST OF BILLY GOATSUniversal App - Designed for iPhone and iPad Unleash chaos as a grumpy goat in this humorous but short-lived casual game.   | Read more »
A New and Improved Wunderlist is Here fo...
A New and Improved Wunderlist is Here for iOS 8 Posted by Jessica Fisher on September 19th, 2014 [ permalink ] Universal App - Designed for iPhone and iPad | Read more »
Evernote Update for iOS 8 Adds Web Clipp...
Evernote Update for iOS 8 Adds Web Clipping, Quick Notes, and More Posted by Ellis Spice on September 19th, 2014 [ permalink ] | Read more »
Apple Names Ultimate Productivity Bundl...
Apple Names Ultimate Productivity Bundle by Readdle as the Essential Bundle on the App Store Posted by Jessica Fisher on September 19th, 2014 [ permalink | Read more »

Price Scanner via MacPrices.net

Updated Price Trackers
We’ve updated our Mac Price Trackers with the latest information on prices, bundles, and availability on systems from Apple’s authorized internet/catalog resellers: - 15″ MacBook Pros - 13″ MacBook... Read more
Mac Pros available for up to $260 off MSRP
Adorama has Mac Pros on sale for up to $260 off MSRP. Shipping is free, and Adorama charges sales tax in NY & NJ only: - 4-core Mac Pro: $2839.99, $160 off MSRP - 6-core Mac Pro: $3739.99, $260... Read more
13-inch 2.6GHz/256GB Retina MacBook Pros avai...
B&H Photo has the 13″ 2.6GHz/256GB Retina MacBook Pro on sale for $1379 including free shipping plus NY sales tax only. Their price is $120 off MSRP. Read more
Previous-generation 15-inch 2.0GHz Retina Mac...
B&H Photo has leftover previous-generation 15″ 2.0GHz Retina MacBook Pros now available for $1599 including free shipping plus NY sales tax only. Their price is $400 off original MSRP. B&H... Read more
21″ 2.7GHz iMac available for $1179, save $12...
Adorama has 21″ 2.7GHz Hawell iMacs on sale for $1179.99 including free shipping. Their price is $120 off MSRP. NY and NJ sales tax only. Read more
iOS 8 Adoption Rate Slower than iOS 7, 6, Hit...
Apple began pushing out iOS 8 updates to eligible devices around 1pm ET on September 17, 2014. However, unlike with iOS 7, which boasted a wide variety of differences from its predecessor iOS 6, in... Read more
LIkely Final Definitive OS X 10.9.5 Mavericks...
Apple has released what will almost certainly be the last incremental version number update of OS X 10.9 Mavericks (save for futire security updates) before OS X 10.10 Yosemite is released next month... Read more
Fingerprints, Apple Pay and Identity Theft Wa...
On Sep 9th, CEO Tim Cook unveiled Apple Pay, along with the new iPhone 6 and iWatch. Apple Pay is a newly developed technology that utilizes a near field communication (NFC) to enable customer... Read more
Amazon Introduces Two All-New Kindles
Amazon on Thursday introduced the 7th generation of its Kindle dedicated e-reader device: Kindle Voyage, its top-of-the-line e-reader, and the new $79 Kindle, with a 20% faster processor, twice the... Read more
Save up to $300 on the price of a new Mac wit...
Purchase a new Mac or iPad at The Apple Store for Education and take up to $300 off MSRP. All teachers, students, and staff of any educational institution qualify for the discount. Shipping is free,... Read more

Jobs Board

*Apple* Retail - Multiple Positions (US) - A...
Sales Specialist - Retail Customer Service and Sales Transform Apple Store visitors into loyal Apple customers. When customers enter the store, you're also the Read more
Project Manager, *Apple* Financial Services...
**Job Summary** Apple Financial Services (AFS) offers consumers, businesses and educational institutions ways to finance Apple purchases. We work with national and Read more
*Apple* Retail - Multiple Positions (US) - A...
Sales Specialist - Retail Customer Service and Sales Transform Apple Store visitors into loyal Apple customers. When customers enter the store, you're also the Read more
*Apple* Retail - Multiple Positions (US) - A...
Sales Specialist - Retail Customer Service and Sales Transform Apple Store visitors into loyal Apple customers. When customers enter the store, you're also the Read more
*Apple* Retail - Multiple Positions (US) - A...
Sales Specialist - Retail Customer Service and Sales Transform Apple Store visitors into loyal Apple customers. When customers enter the store, you're also the Read more
All contents are Copyright 1984-2011 by Xplain Corporation. All rights reserved. Theme designed by Icreon.