[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference ricks::dechips

Title:Hudson VLSI
Notice:For Digital Chip Data - CHIPBZ::PRODUCTION$:[DS_INFO...]
Moderator:RICKS::PHIPPS
Created:Wed Feb 12 1986
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:701
Total number of notes:4658

569.0. "21164PC announced?" by HGOVC::DAVIDLEE () Fri Sep 13 1996 11:02

T.RTitleUserPersonal
Name
DateLines
569.1Not announced yet.ROCK::BANNONFri Sep 13 1996 11:2617
569.2PKO is fullROCK::BANNONFri Sep 13 1996 22:4725
569.3ODIXIE::MOREAUKen Moreau;Technical Support;FloridaFri Sep 13 1996 23:2630
569.4Not planned coverageROCK::BANNONSat Sep 14 1996 21:5913
569.5ODIXIE::MOREAUKen Moreau;Technical Support;FloridaSun Sep 15 1996 20:4131
569.6analysts?NAMIX::jptFIS and ChipsWed Sep 18 1996 05:548
569.7More money...ROM01::OLD_CIPOLLABruno CipollaWed Sep 18 1996 07:197
569.8HLO Oct 16 4-5pmROCK::BANNONFri Sep 27 1996 14:3121
569.9ROCK::BANNONFri Sep 27 1996 14:3228
569.10NPSS::NEWTONThomas NewtonMon Nov 18 1996 15:3454
569.11I agree with Terry Shannon's bottom line....PERFOM::LICEA_KANEwhen it's comin' from the leftTue Nov 19 1996 09:505
569.12What price point would change this???PATE::LEWISMichael LewisTue Nov 19 1996 10:4815
569.13Gartner does FX!32PCBUOA::KRATZTue Nov 19 1996 10:498
569.14NPSS::NEWTONThomas NewtonTue Nov 19 1996 11:3440
569.15NPSS::NEWTONThomas NewtonTue Nov 19 1996 12:0111
569.16It's the marketing, not the productWRKSYS::SCHUMANNTue Nov 19 1996 14:0512
569.17Consumers buying alpha? Probably not. But at work....PERFOM::LICEA_KANEwhen it's comin' from the leftTue Nov 19 1996 14:3828
569.18PCBUOA::KRATZTue Nov 19 1996 14:474
569.19This is all I meant....PERFOM::LICEA_KANEwhen it's comin' from the leftTue Nov 19 1996 15:0612
569.20mr. bill, put the Kratz-filter on your notes extractorTIGNES::HOBBSCongrats to the Ignoble Peace Prize winner! (http://www.eecs.haTue Nov 19 1996 15:131
569.21NPSS::NEWTONThomas NewtonTue Nov 19 1996 16:114
569.22AXEL::FOLEYhttp://axel.zko.dec.comTue Nov 19 1996 17:2910
569.23Then tell us "not mainstream"...ATLANT::SCHMIDTSee http://atlant2.zko.dec.com/Tue Nov 19 1996 20:4410
569.24Future != presentWRKSYS::INGRAHAMAndyWed Nov 20 1996 07:343
569.25NT 4.0 + Alpha Native + FX!32 + OpenGL, nice piecesPERFOM::LICEA_KANEwhen it's comin' from the leftWed Nov 20 1996 10:3129
569.26TALLIS::DARCYGeorge Darcy, TAY1-2/G3 DTN 227-4109Wed Nov 20 1996 11:5039
569.27NPSS::NEWTONThomas NewtonWed Nov 20 1996 15:0311
569.28FORTY2::PALKAThu Nov 21 1996 07:2218
569.29The processor itself is rather bland....PERFOM::LICEA_KANEwhen it's comin' from the leftThu Nov 21 1996 09:1634
569.30RLTIME::COOKThu Nov 21 1996 10:3710
569.31PERFOM::LICEA_KANEwhen it's comin' from the leftThu Nov 21 1996 13:0414
569.32PCBUOA::KRATZThu Nov 21 1996 13:083
569.33R4640 used in webtv?NPSS::WATERSI need an egg-laying woolmilkpig.Mon Dec 02 1996 11:0314
569.34Microprocessor ReportROCK::BANNONTue Dec 03 1996 09:4816
569.35Typo or Freudian slip?CPEEDY::FLEURYTue Dec 03 1996 10:5511
569.36PCBUOA::KRATZTue Dec 03 1996 11:236
569.37KOALA::BRIGGSTue Dec 03 1996 11:3928
569.38Microprocessor Report biased towards Intel?NPSS::NEWTONThomas NewtonTue Dec 03 1996 11:4916
569.39I thought *they* were 2 years behind *us*...SMURF::STRANGESteve Strange, UNIX FilesystemsTue Dec 03 1996 11:5615
569.40If not Digital, who?PERFOM::LICEA_KANEwhen it's comin' from the leftTue Dec 03 1996 12:1218
569.41re: .33 (Which Mips where sidetrack....)PERFOM::LICEA_KANEwhen it's comin' from the leftTue Dec 03 1996 12:146
569.42TALLIS::DARCYGeorge Darcy, TAY1-2/G3 DTN 227-4109Tue Dec 03 1996 12:346
569.43PCBUOA::KRATZTue Dec 03 1996 13:1112
569.44KOALA::BRIGGSTue Dec 03 1996 14:1013
569.45May 1997 Byte Magazine articlePERFOM::LICEA_KANEwhen it's comin' from the leftMon May 05 1997 14:165
    
    CPUs: Alpha Arrives at the Desktop, by Peter J. Bannon.
    _Byte_, May 1997, vol 22, no 5, pp59.
    
    								-mr. bill
569.46Anyone scanned it?GRIFUN::OLOBARDIMarco- Tech.OEM Mktg-ItalyTue May 06 1997 13:484
Hi, 
	any way to get a scanned version somewhere? 
Ciao ciao
					Marco
569.47Good work, let's see lots more.BBPBV1::WALLACEjohn wallace @ bbp. +44 860 675093Tue May 06 1997 14:112
    Since Peter J Bannon is a DIGITAL employee, and is credited as such in
    the article, can he post it for use internally anyway ?
569.4821164PC in ByteROCK::BANNONTue May 06 1997 19:28230
    Below is the final draft that I sent to Byte.  Some editting
    was done by Byte, but it should be close to the published
    version.
    Pete
    

Alpha 21164PC -- Leadership Performance for Windows NT Desktop 
Systems

By Pete Bannon


Smaller,  cheaper and faster are the watchwords for semiconductor
manufacturers today as they scramble to satisfy the 
enormous appetites of multimedia, CAD, and data manipulation 
applications on the desktop.   Corporations and small businesses are 
using desktop systems for video conferencing, voice synthesis, and 
enterpise-wide data access.   Home users are surfing the Web, 
running sophisticated video games and creating home movies on 
their PCs.  As each breakthrough drives the imagination toward new 
horizons, this trend is not likely to abate anytime soon.

Designed with these applications in mind, Digital Semiconductor's 
new Alpha 21164PC microprocessor delivers more CPU cycles and greater data
bandwidth in a smaller package than any other microprocessor on the market. 
For example, the 533-MHz 21164PC has a smaller die size than the 200-MHz
Pentium with MMX and provides significantly higher performance.   The 21164PC
outperforms the Pentium chip by more than two times in SPECint95 performance
and by more than three times in SPECfp95 performance.  The 21164PC, which was
co-designed by Digital Semiconductor and Mitsubishi, supports an astonishing
2.1 BIPS (2133 MIPS) and 1066 MFLOPS. The estimated performance of the 21164PC
configured with 2MB of external cache and 125ns main memory is 14 SPECint95 and
17 SPECfp95. These characteristics and the 21164PC's price points make an
Alpha processor solution the clear leader for Windows NT PCs. 

Full Windows compatibility provides an additional 21164PC edge in the
desktop market.  In addition to a large number of native Windows NT
applications, DIGITAL FX!32, a breakthough software translation technology,
gives Alpha system users access to the full suite of 32-bit x86 Windows and
Windows NT applications, running with high performance. 

Further, as the first Alpha processor to implement Motion Video Instructions
(MVI), the 21164PC dramatically increases the Alpha processor advantage over
competing products in motion video applications.   For example, the
21164PC supports full-frame-rate DVD playback and high-quality video
conferencing in software, eliminating the need for dedicated multimedia
hardware and reducing overall system cost. 

Innovation, Small Die Size Target PC Market

A full implementation of the Alpha architecture, the 21164PC 
leverages the design of Digital Semiconductor's Alpha 21164, a 
processor that has maintained performance leadership in the 
industry since its introduction.  The 21164PC, depicted in Figure 1, draws from
this technology and incorporates innovation and a smaller die size to 
achieve its advanced design. 

Implemented in Digital Semiconductor's 0.35-micron CMOS process, 
the 21164PC features a die size of  8.5 mm by 16.2 mm and contains 
3.5 million transistors.    This small die size  (a 30% reduction from 
previous Alpha processor implementations) enables significant 
manufacturing cost savings, which translate directly to more 
affordable PCs for a broader market.   

The Alpha Architecture

The Alpha architecture is a 64-bit load and store RISC architecture 
that is designed with particular emphasis on speed, multiple 
instruction issue, and multiple processors and focuses on  
uncompromised support for many operating systems, including Windows NT, 
Digital UNIX, Linux, and Open VMS.   All registers are 64 bits long 
and all operations are performed between these registers.  Alpha 
instructions are 32 bits long and memory operations are either load 
or store operations of data that is 8, 16, 32, or 64 bits in length.   

The 21164PC takes full advantage of the Alpha architecture in a 
quad-issue implementation featuring a 7-stage integer pipeline and  
9-stage floating point pipeline.  The 21164PC has a large 16KB 
instruction cache (Icache) and features a bandwidth of 8000MB per 
second to the instruction issue unit.  This exceptional data transfer 
capacity plus an aggressive instruction pre-fetch scheme keep the 
chip's pipelines full.  The 21164PC's Icache pre-fetches 96 bytes 
ahead of the current program counter, providing significant 
performance improvements in long code sequences where instructions 
can be fetched 300% faster than is possible without pre-fetching. 

Streamlined Instruction Issue and Execution

The 21164PC's instruction unit is comprised of an instruction buffer, 
slotter, and issue unit.  The simple instruction issue design 
maximizes the 21164PC's clock frequency with little impact on the number of
instructions that can be issued in a cycle.  The microprocessor's instruction
buffer holds two sets of four instructions, facilitating the chip's quad-issue
operation.  The buffer optimizes the flow of instructions into the slotter unit
by removing bubbles from the pipeline that are caused by taken branches.   The
slotter attempts to assign four instructions to the pipelines each cycle and
refills when all the instructions have been assigned.  The issue unit allows
the instructions to execute after assuring the availability of the required
system resources. 

The integer execution unit contains a register file and several 
functional units in four stages of two parallel pipelines.  The 
pipelines contain differing sets of units with the 64-bit adder, logical and
load units being common to both pipelines.  The 21164PC's instructions execute
in one cycle with the exception of loads and conditional move instruction, which
require two cycles.  In addition, the 21164PC incorporates a special hardware
feature that allows the common code sequence, COMPARE followed by DEPENDENT
BRANCH, to execute in one cycle instead of two, thereby streamlining
application performance. 
 
The new MVI instructions -- PERR, MAX/MIN and PACK/UNPACK -- are implemented in
the integer unit, saving die space and reducing cost.  This efficient
implementation delivers an impressive 400% improvement in MPEG-2 compression
for the very low cost of 0.6% of the 21164PC's area.   This design is possible
because the 21164PC's 31 64-bit integer registers provide sufficient storage
to support the chip's issue bandwidth of 533 million MVI instructions per
second concurrently with 533 million additional integer instructions per
second.  In addition, the supporting instructions used by MVI, including
compares, adds, shifts, loads, and stores, already exist in the integer unit,
eliminating the need for additional instruction logic on the 21164PC chip. 

The 21164PC's floating point unit allows for exceptional performance in
floating point-intensive applications such as 3-D graphics on the desktop. 
The unit is made up of two 64-bit execution units -- an add pipeline that
executes all floating point instructions except multiply, and a multiply
pipeline.  Both units are fully pipelined and have a latency of four cycles.   
To maximize performance further, the floating point unit incorporates two
dedicated floating point load data pipelines that allow floating point load and
store instructions to be executed in parallel with floating point operates. 

Memory Unit Delivers High Throughput 

The 21164PC's memory unit, which features very high data 
bandwidth, maximizes operational efficiency and CPU utilization.   
The 8KB data cache (Dcache)-- a dual-ported, fully pipelined, non-
blocking cache -- allows the 21164PC to move rapidly through 
programs that process large amounts of data.   Because the Dcache is 
non-blocking (up to 21 loads can miss), the processor can continue 
to operate uninterrupted when cache misses occur.  In addition, the design
interleaves cache fills from memory with processor operations.  These design
characteristics give the 21164PC a significant advantage compared with other
processor designs.   For example,  the peak data bandwidth of the 21164PC
operating at 533 MHz is ten times higher than the peak bandwidth of a Pentium
operating at 200 MHz. 

Further, due to the 21164PC's robust write buffer, more things 
can happen simultaneously in memory.  The write buffer has six 32-
byte entries, with each entry providing an opportunity to collapse into a 
single transaction multiple writes to the same address.

The 21164PC's L2 cache controller helps maximize application 
performance by streamlining L2 cache accesses.  The cache 
controller, which is also non-blocking, does this by ordering requests 
to the L2 cache to achieve an optimal balance between bandwidth 
utilization and access latency.  

PC-Compatible Motherboard 

Figure 2 depicts the  block diagram of the  AlphaPC 164SX motherboard, a sample
design incorporating the 21164PC.  Featuring an ATX form factor and outstanding
price/performance, the motherboard is ideally suited for Windows NT desktop
systems. The design database is available, at no charge, from Digital
Semiconductor. 

The AlphaPC 164SX motherboard satisfies all ATX requirements for hole 
placement, component spacing and component height.  The six-layer 
AlphaPC 164SX module can be installed in any ATX enclosure and 
uses standard ATX power supplies.  

The AlphaPC 164SX also offers extensive flexibility,  allowing PC 
manufacturers to configure systems that satisfy a broad range of 
applications, cost effectively.  The AlphaPC 164SX's 413-pin ZIF 
socket accepts 400-MHz and 533-MHz 21164PCs, giving PC 
manufacturers two high-performance processor choices.    The 
motherboard also accepts L2 caches sized from 512KB to 4MB and 
operating at speeds of 66MHz to 133MHz, offering  a spectrum of 
data handling capabilities.    

The 21174 Core Logic Chipset that is configured on the motherboard 
provides high-speed access to memory and PCI peripheral devices.  
The 21174 features support for 16Mbit or 64Mbit SDRAM memory in 
configurations from 32MB to 512MB.  Using the chipset's PCI interface, the
motherboard accommodates a full range of I/O device configurations.   
The AlphaPC 164SX contains two 64-bit and two 32-bit PCI slots.  Operating at
33 MHz, the 64-bit PCI interfaces provide up to 260MB per second of I/O
bandwidth, satisfying the high-performance demands of I/O devices such as
graphics, ATMs and RAID. 

The 21164PC is also fully compatible with existing chipsets, such
as Digital Semiconductor's 21172 for the Alpha 21164, allowing
manufacturers to design-in these products.

To enable easy, cost-effective configuration of traditional ISA 
devices, the AlphaPC 164SX motherboard incorporates a Cypris CY82C693 PCI to 
ISA bridge and two ISA slots.   This part provides on-board USB, IDE and
keyboard/mouse interfaces. 

Availability

Alpha 21164PC microprocessors will be available from Digital Semiconductor
in Q2 of this year.  Two versions of the 21164PC -- 400-MHz and 533-MHz -- will
be offered for approximately $1/MHz. 

For more information regarding these and other Digital Semiconductor Alpha
products, contact your local semiconductor distributor or call the Digital
Semiconductor Information Line: United States and Canada 1-800-332-2717;
outside North America +1-508-628-4760.  Or visit the Digital Semiconductor
Alpha Web site: www.alpha.digital.com. 

 
********************************************************************
Biography for Peter Bannon

                           Peter J. Bannon
                         Consulting Engineer
                        Digital Semiconductor
                    Digital Equipment Corporation

Peter J. Bannon is a hardware consulting engineer with Digital 
Equipment Corporation.  He has participated in the design and/or 
verification of several CISC and RISC microprocessor chips and was a 
member of the Alpha 21164 architecture team.  He joined Digital in 
1984 after receiving a Bachelor of Science in computer system design 
from the University of Massachusetts.

Bannon holds three patents for VAX CPU designs and has filed six 
patent applications for the Alpha 21164 microprocessor.

                                 ###
    
569.49HELIX::SONTAKKEWed May 07 1997 15:483
>     125ns main memory 
    
    Is that a typo ?
569.50DECCXL::OUELLETTEmudseason into blackfly seasonWed May 07 1997 20:481
Perhaps it's total round trip latency...
569.51ROCK::BANNONThu May 08 1997 14:453
    Its not a typo, the memory latency from load to use is at least 125ns.
    
    
569.52AMCFAC::RABAHYdtn 471-5160, outside 1-810-347-5160Thu May 08 1997 16:221
Issue them reads early and often.
569.53DECCXL::OUELLETTEmudseason into blackfly seasonThu May 08 1997 16:544
It's cache as cache can for this stuff.
Them 60 nS SIMMs are just one step in getting data.
At 500 MHz, 4 way issue allows 200+ other instructions
to go in one load from memory latency (in theory at least).
569.54better than most stuff we sellWRKSYS::HOUSEKenny House, Workstations EngineeringThu May 08 1997 17:113
    Note that 125 ns latency to main memory is pretty darned good. 
    
    -- Kenny House