xeb.ai

14 November 2022

Future AGI Primitives

I know AGI predictions are rote but I’m going to write some down anyways. After all, intelligence is in many ways the act of prediction, right? Shouldn’t the field of artificial intelligence have an expectation — nay an obligation! — to continually refine predictions of its own future outcomes and capabilities?

Actually, let’s make this list more of my own wishlist for AI and AI-adjacent technologies (which may or may not be viable businesses). These seem important in the coming intelligence explosion we find ourselves in.

Emissary - Imagine a system that acts as an authorized and cryptographically signed emissary for a known human being. The agent is sent off to perform actions. Those actions are the legal responsibility of the human owner (unless fault can be found with other systems). The Emissary is the primary, customized, and individual personality of an AGI for each human to interact with.
Eunice -- from William Gibson's book Agency -- is an example of an Emissary (but albeit "unbound" to a human)
AI-UIs - Imagine interactions with future AGIs to be based on dynamic visual user interfaces. UIs that are personalized, dynamic, and contextual. While language interfaces are quickly becoming pervasive (or at least folks talk about them being all the rage!). Typing or transcribing text tokens just takes longer compared to a single "CONFIRM" button.

Example: a system needs to prompt a user to pick a new appointment date. Rather than a classic date picker, an AI-powered UI model generates (given different contexts and calendar data) three buttons:
Data Stewards - Imagine an AI-powered authorization system that is trained over-time on your data access requests. Any time an AIUI or foreign system needs access to your: calendar, email, contacts, photos, memories, or physical access to your home -- a Data Steward system can infer the human or agent's intent and authorization status. When in doubt, the Data Steward can leverage an AIUI to confirm access.

Alex-to-Emissary: "ask Mark to send me the pictures of our families at the beach"

(Alex's Emissary reaches out to my Data Steward)
(My Data Steward thinks this is fairly normal, but given the context of me in a bathing suit, thinks it is best to ask permission first from me and then send a note to Alex's Emissary)
(My Data Steward leverages an AIUI to ask me one simple question:)

Data Steward-to-Mark: "Do you want to share your Beach pictures from Saturday at Doheny Beach with Alex Jones?"
Inferential Media Types - Our data needs to be progressively augmented with intelligent inferences. JPEG EXIF data should have binary embeddings or hashes of FaceNet IDs that the custodians of the image are able to parse out. Computer vision models need to augment the raw media files they process to progressively enhance them rather than keeping that knowledge separate.
Model Development Experience — Composing and using models in Python is not as accessible as JavaScript and web app development was 20 years ago. We need more. If Scale is All You Need, then AI Infrastructure needs a ton of work! The big ones on my list are: apt-sourced NVIDIA drivers (including CUDA and cuDNN) that works out of the box, standards for model storage (should all models be versioned in docker so inference and future training contains the right dependencies of the algorithm?). We also need composable models. I want to chain computer vision pipelines together, then add automated speech recognition later.
Assistive Operating Systems - Adept and ACT-1 are awesome looking! how can I get my entire Operating System to be assistive by AI in a generalized way? Automatically suggesting completions for me or text? Maybe we need a "accept suggestion" key on a keyboard.
Burst Compute - massive infrastructure for extremely brief periods of time. I want 1,000x A100s for 15 seconds and not pay a penny more. My Emissary and personalized Data Steward will authorize any devices, agents, or other intelligences in my digital and physical life with necessary burst compute. It's not feasible to build a smart switch with the latest ASR and NLU/NLP models to carry on a conversation, but it is reasonable to put the instruments (microphone and speaker) with cloud access to get the compute to carry on a conversation as needed during setup. I imagine we'd also want the Data Steward authorizing the compute to be swappable. If my Dad is chatting away at my smart switch -- he should pay for that GPU time. Get your own personified plug, Dad!
Embedded Agents - Smart Home device setup sucks. Can I get 10 different brands under the same roof and get them to get along? Will my Emissary and Data Steward be the primary ones responsibly for keeping these devices in check? And if they do not -- how can I interrogate them?

Mark-to-Emissary: "do you see this smart switch? What is it? It won't turn on."

(Mark's Emissary reaches out to the Data Steward in the device -- there is no Emissary becuase the device, in this case, was not sent by a human)
(The smart switch Data Steward says it has no WiFi password saved. It used to belong to a similar SSID but the old password doesn't match. The Emissary recognizes the MAC address and the old WPA/2 authentication keyphrase. This is indeed an old switch. The Emissary asks if the switch can have the latest WiFi)

Data Steward-to-Mark: "This is an old Smart Switch that was disabled 2 years ago. Do you want me to share your latest network configuration with it and install it into the alarm system sensor array?"
Verifiable Robotics — This is the penultimate for me. An action transformer turned into the physical universe. Truly multi-modal. Simulation can be like unit tests, but we need a primitive for integration tests. Huge warehouses where robots can be tested offsite and virutally. Perhaps they have simulated models of my home (that do not physically exist), but the robots themselves are real and have to "take a spin around the house" ensuring they work with other agents and systems deployed.