Difference between revisions of "Compute architectures"

From apm
Jump to: navigation, search
m
(added an page intro == Delineation ==)
Line 1: Line 1:
 
{{Stub}}
 
{{Stub}}
 +
 +
== Delineation ==
 +
 +
This page is about compute architectures aimed for physical (on silicon) implementation <br>
 +
This page is not about various exotic computing substrates (not: https://en.wikipedia.org/wiki/Unconventional_computing) <br>
 +
This page is not about purely theoretical machines with no means of mapping them to an on chip design. <br>
 +
Although the lines can blur a bit on that. <br>
  
 
== Mainstream ==
 
== Mainstream ==

Revision as of 13:26, 2 September 2024

This article is a stub. It needs to be expanded.

Delineation

This page is about compute architectures aimed for physical (on silicon) implementation
This page is not about various exotic computing substrates (not: https://en.wikipedia.org/wiki/Unconventional_computing)
This page is not about purely theoretical machines with no means of mapping them to an on chip design.
Although the lines can blur a bit on that.

Mainstream

There's plenty of resources out on the web so not much details here.

  • Von Neumann architecture (today's 2024 CPUs)
    – Issues here are the enforced serial nature of data processing and the "von Neumann bottleneck".
  • Graphical Processing Units (GPUs still highly specialized on triangle processing, but changing as of 2024)
  • FPGAs (Field Programmable Gate Arrays)
  • Emerging: NPUs & TPUs (for AI) – (not yet more general neuromorphic computing)

Dedicated compute hardware specifically designed for pure functional languages (lambda calculus)

Term normalization by parallel asynchronous term rewriting

Older works - lambda calculus evaluating hardware

(wiki-TODO: Add details (SECD))

Green arrays (related to stack based programming)

Compared to systolic arrays:

  • Scale and generality: Green Arrays nodes are more general-purpose and typically deployed at a larger scale on a single chip.
  • Asynchronous vs. synchronous: Green Arrays operates asynchronously, while systolic arrays are typically synchronous.
  • Programming model: Green Arrays uses a Forth-inspired model, which is quite different from the typically fixed-function nature of systolic arrays.
  • Data flow: Systolic arrays have a more rigid, predetermined data flow, while Green Arrays allows for more flexible data movement between nodes.
  • Application scope: Systolic arrays are often optimized for specific algorithms, while Green Arrays aim for broader applicability.

Green Arrays Bootstrapping Process

Initial State:
The chip starts with one active node (often called the "boot node").
All other nodes are in a dormant or unconfigured state.

Propagation:
The boot node begins by configuring its immediate neighbors.
It loads them with basic functionality, essentially "waking them up".

Cascading Configuration:
The newly configured nodes then participate in configuring their own neighbors.
This process cascades across the chip, with each node potentially configuring others.

Dynamic Programming:
As the configuration spreads, nodes can be programmed with different functionalities.
This allows the chip to configure itself for various tasks dynamically.

Adaptive Behavior:
The configuration process can adapt based on the task at hand or the state of the chip.
This allows for efficient use of resources and fault tolerance.

Collective Intelligence:
The end result is a chip where the collective behavior emerges from the interaction of many simple, individually programmed nodes.

Reconfigurable Asynchronous Logic Automata (RALA)

A physical computing that aims to match to the 3D spacial constraints of our real world.
By Neil Gershenfeld (MIT Center of Bits and Atoms)
(wiki-TODO: Add more details eventually.)

Systolic arrays

(wiki-TODO: Add details)

Cellular automata

These are usually not seen as practical general purpose compute architectures.
While some are Turing complete (e.g. Conways game of life)
they seem not particularly suitable/practical for general purpose computations.

Typically they feature simple rules per cell. Thus expressive capabilities are limited.
But they feature complex emergent behaviour which is making them interesting to study.
Obviously they are limited to 3D lattices in physical implementations.

Reversible computing architectures

Se also: Reversible computing & Well merging

External links