|
||||||
|
Wondering how U.S. federal nanotech tax dollars are spent? Obama’s first President’s Council of Advisors on Science and Technology (PCAST) review will be webcast live tomorrow, March 12. This review only occurs every two years so this is your big chance to see what the current administration thinks of the NNI. Thirty minutes are set aside for public comment. The webcast link should be available at whitehouse.gov tomorrow (Friday). I’m on my way to AGI-10, the general AI conference, in Lugano. If any readers are attending, let’s get together. Among other things, we’ll be unveiling a preliminary take on the AGI Roadmap (of which Foresight is a sponsor). The UK-based Institute of Physics (IOP) publishes, among other things, the journal Nanotechnology, one of the leading journals in the field, and has had special issues with papers from Foresight conferences gaoing back to the 90s. It was thus somewhat surprising, yet gratifying, to find them submitting quite a strongly-worded critique of practices in climatology that echo some of the concerns I’ve mentioned here about the impact of the shennanigans on the credibility of science as a whole:
It’s been snowing continuously here for about 2 days. The heaviest snows I’ve experienced in my life (for any significant amount of time) were an inch an hour, but this has been half that — amounting to a foot a day. If it were to keep snowing like this for a week, it would be a major emergency; if for a month, the area would become uninhabitable. As usual, people who disbelieve in global warming point at the record snow coverage extents this year and say they disprove it. As usual, people who are global warming supporters claim that global warming is causing the snow. GCMs tend to have forecast lower snow cover with rising global temperature, indicating a positive albedo feedback. That much seems clearly wrong. On the other hand, it’s not totally counterintuitive that rising energy input in the tropics could cause more evaporation, feeding more water into an atmospheric conveyor leading to more snow in the cold places. This could actually form a negative feedback in the climate system, putting an upper limit on global temperatures. There has to be some kind of such a limit, since something appears to stop the positive feedback loop (between temperature and CO2) that drives the exponential takeoff out of ice ages into interglacials. Anyway, snow. Whatever the reason, we’re about 3 months of what it’s doing right this minute from an ice age. I don’t have a clue how likely this is to happen how soon, but looking at the last million years of climate, ice age is the normal condition of Earth and interglacials are few and far between. We can only hope we’ve had the sense to develop real nanotech before we’re back in one. Robin Hanson comments on David Brin’s response to a New Scientist editorial.
In fact, it’s a lot worse than that. As far as I can tell, nobody talking about interstellar contact has a model even vaguely close to a reasonable analysis of the situation. Short form: these discussions are the equivalent of the natives of a Polynesian island deciding who shall be allowed to wave as the galleons heave into view. Our own technology, today, is getting close to detecting Earth-like planets around other stars, for heaven’s sake. The galleons see the island, not the waving. Scientific elites declaring moratoria on SETI transmissions are about as important to the future of the human race as whether we call Pluto a planet or a dwarf planet. The discussions are entirely about political dominance among scientists, and nothing to do with reality. Reality is that any alien race out there with whom we have any kind of physical contact at all is virtually certain to have (a) full-fledged nanotech, and (b) hyperhuman AI. Given these capabilities, if they want to find Earth-like planets anywhere in the area of space they would have the physical capability of travelling to, they will find them. Period. Doesn’t matter whether we are standing on the shore waving or not. Of course, that assumes they are interested in Earth-like planets in the first place. Most commentators on the subject seem to be stuck in E. E. Smith’s universe, worrying about whether the aliens who notice us will be the (kindly, academic) Norlaminians or the (evil, rapacious) Fenachrone. The aliens, wearing bodies like ours (or at least some form of animal life) will have spaceships and spacesuits and takeoff and land on planets and basically act like people on ocean-going boats. Star travel is expensive; it costs on the order of a ship’s own mass in equivalent energy to get it up to relativistic speeds. Any culture capable of that will be at least a Kardashev Type I civilization, and most likely a Type II. And the reason they’ll be doing star travel is to work their way up towards Type III. Any sentient creatures that actually get here will be nanotech-based robots, not water-based organisms. They won’t have spacecraft, they’ll be spacecraft. They will be unlikely interested in the carbon-poor mudballs of the inner solar system, but reap abundant carbon from the outer planets and carbonaceous asteroids to build Dyson-sphere-like structures around the orbit of Mercury. We simply aren’t going to see less sophisticated visitors due to the starship paradox: send a starship out now with all Earth’s current technological resources behind it, and then wait and send one in 50 years with full nanotech. The second one gets there first. We aren’t going to see any less ambitious visitors due to simple evolution: in a universe where the ultimate meaning of “carbon footprint” is the total mass of the superintelligent diamondoid robots you’ve built, spaceships burning cellulosic ethanol simply aren’t going to be anywhere near the fittest. Indeed, cultures that aren’t inherently aggressive and ambitious aren’t going to put the effort into sending out starships at all. The question is, what are they going to think of us, the thin layer of green slime coating an insignificant rock? If I were an aggressive superintelligent nanotech robot, I would tend to place the boundary between “people” and “raw material” at the boundary of aggressive superintelligent nanotech robots and everything else. I might — just might — make a sentimental exception for intelligent organic species such as my ancestors. “Such as” in this case means intelligent organic species which are on a clear track to building aggressive superintelligent nanotech robots. Or, of course, has already done so. If you really want them to show up as friendly neighbors, start working on that Dyson Sphere yourself. If, on the other hand, you’re a culture that has elevated cowardice (“Precautionary Principle”) to be its highest virtue … you’re just dirt. Let’s try to pull all the threads together, as futurists — which is the whole point here — and get some idea about when it might be reasonable to expect AI to show up. When I say AI I want to look at the entire diahuman range, so the answer would still be a range even if we were historians looking back on the process from the vantage point of the far future. I’ve claimed that “I think we have the techniques now to build an AI at the hypo/dia border, equivalent to a dull but functional human.” That doesn’t mean we have one now, or even that one is possible next year. What it means is that by the kind of techniques we can now use to program self-driving cars, we could, with a major development effort, program an AI that would be able to do as broad a range of that kind of task as a very dull human can, but which would need additional programming to do new tasks. Commenter Alex Kilpatrick put forward a cogent objection to the “AI is near” thesis, writing:
I agree with this strongly as a description of the state of AI today in general, but with one major reservation. Not entirely all of the AI field is nothing but clever programming. The AI programs that do the most impressive application tasks certainly are — because the efforts to build general learning machines are less than babies at the moment. The key to moving up from the hypo/dia border into the diahuman range is imitation. I’d guess that the state of the art would let us build a machine that would be able to watch someone sweeping a room and be able to sweep the same room with more or less the same series of strokes, being brittle to changes in the furniture positions and so forth. (Consider the kind of learning demonstrated in Ng’s helicopter.) Building an AI that could watch lots of sweeping and then be able to figure out on its own how to sweep a new room — without having been programmed with any knowledge of sweeping ahead of time — is the kind of thing we need to advance the state of the art. The difference is that in the second case the AI is inferring a model and a program from observations. But this is what 21st century AI is (already) all about — typically, today, inferring statistical models from reams and reams of observations, but at least tackling the right problem. The main thing that will determine the rate of advance is how much of the clever programming goes directly into end applications and how much goes into basic core learning. Concept formation, model building, program inference, and so on are a quantum step harder than parameter tuning in a known ontology. However, the math for that kind of thing is advancing, and the processing power to use techniques such as search and GAs is on its way in the next decade. I don’t think we’ll have a superintelligent AI by 2020; indeed, I don’t think we’ll even have one that can educate itself by reading Wikipedia. But I do think it’s at least a 50% chance we’ll have AIs that can learn something by a combination of imitation and careful verbal coaching. Rob Freitas has a new paper up:
Why we should suspect that the brain has a limited ability to recurse, but prefers to daisy-chain instead:
This final phrase of the classic Star Trek opening spiel had two problems with it, one as seen by people after the fact, and the other as seen by those who had gone before. As seen by earlier generations, the phrase “to boldly go” is a split infinitive. If E.E. Smith had written Star Trek in the ’20s, he would have written “boldly to go.” Avoidance of split infinitives, like many elements of grammatical style, was a cognitively expensive signalling behavior that advertised, essentially, that the speaker or writer was in the educated classes in an era where being educated meant you knew Latin. Infinitives in Latin are single words formed by inflection, rather than with a keyword such as “to” in English, so you can’t put an adverb in the middle of one. Avoidance of split infinitives lasted in “proper” English at least until mid-20th century, but had begun to fade (to slowly fade But there’s no real reason in English not to split infinitives. They are completely understandable, and often less ambiguous than alternative constructions. Apart from being a cognitively expensive signalling behavior, they had no value, indeed a cost in cumbersome and ambiguous sentences. Like many rules which were tacked onto the language by well-meaning grammarians, they were overly simplistic formalizations of a system, English grammar, which was and remains much deeper and more complex than anyone thought it was. The lack of the ability of “hand-coded” grammar to handle real language is most clearly displayed in the early attempts at machine translation, which were an abject failure. Only after 50 years of trying in natural language understanding, using statistically inferred models not formalized by humans, has serious progress been made (and there’s lots more progress needed before basic competence is achieved). The other, retrospective, problem with the Star Trek blurb, that the phrasing was considered sexist, was corrected in later incarnations to the more politically correct “where no one has gone before.” This, as it turns out, is another example where the unthinking application of a simple, formalized rule to “fix” something actually makes it worse. In the ’60s, the use of “man” in such a context was standard and unambiguous. It meant a human being (in fact, in my Websters from that era, “human being” is the first primary meaning, specialization to adult males being secondary). Star Trek, if you remember or have studied these things at all, was in its time one of the most progressive, liberal science fiction shows ever. It depicted a crew and implied a culture where barriers based on race and sex had been significantly lowered compared to the contemporary norm. So in the original, “man” meant “human.” Of course, the Enterprise went all over the galaxy seeking new life, new civilizations. The citizens of these civilizations had been there before. The distinction between “man” and these people is a fine one but one which can be reasonably be made consistent with the storyline.
Just as was Victorian proper grammar, politically correct speech patterns are primarily a cognitively expensive signalling behavior. They have exactly the same import: the speaker is educated, intelligent, and ambitious enough to pay the cognitive price to consciously modify the vernacular. But as we have seen, PC speech is often yet another case of simplistic human-formalized rules, applied in a context-free way. They fail on their own terms — implying just the wrong thing, as above — when context shifts. In other words, simple human-formalized rules applied blindly to something as complex as grammar are brittle, a property they share with bureaucratic rules and AI programs. Human ethics are similar to human language in their depth and complexity. They are famously just as difficult to capture in simplistic formalism. Indeed, given the examples of PC speech, it’s quite arguable that grammar and speech are a proper subset of ethics. You can’t even reason about whether the Star Trek example is right or wrong without understanding language at a probably better than state of the art level. And it’s certain that all the subtleties of ontology and epistemology are part of ethics, just as they are of language. AI is just, in the past decade or so, beginning to get traction in the natural language field beyond the simple human-written formal rules stage. As for ethics, we’ve just barely gotten into the simple human-written formal rules stage. But if you want a preview of what machine ethics will ultimately look like, study modern natural language processing. Foresight Institute Feynman Prize winner Dr. Ralph Merkle, perhaps better known to Nanodot readers for his nanotech work, has just won the IEEE’s Hamming Medal along with Martin Hellman and Whitfield Diffie:
Read the whole article for the interesting details and politics behind the work, and a great photo of all three back in 1975 (lots of hair). Congratulations! —Chris Peterson
Although we have talked mostly about robotics in terms of how AI has been advancing, it’s instructive to look at developments in the other subfields as well. Natural Language Processing is among the oldest. Turing’s classic paper from 1950 laid out the ability to converse in ordinary, unstructured text as an unequivocal test of the point a machine could be said to think. Although much can and has been written on the validity of the Turing Test as specified, it is clearly true that a computer with the ability to converse fluently in written and spoken English would be enormously more useful than the computers we have today. I think it’s also reasonably clear that most people would assume that there was “somebody home,” i.e. begin to impute intelligence and a self to such a computer. The paradigm shift in NLP has been a result of two things: the increasing willingness (and ability) of AI researchers to use statistics and other numerical methods from the scientist’s toolkit, and the increasing size and avalibility of databases and corpuses and the processing power to subject them to intense analysis. The amateur AIer today can go on the web and obtain for free enough data and programs to cobble together an NLP system better than anything that existed in 1990. The processing power in a high-end workstation is good enough for a reasonable amount of research, but a couple more orders of magnitude will help immensely. What statistical methods have done, in essence, is replace the hand-written grammars that characterized classic-era AI NLP. These are probabilistic, trained on huge corpuses, and are considerably more robust in use than the old ones. On the other hand, they don’t reach up into the heights of semantics as well. I’d claim that there’s not a NLP system today that understands its entire vocabulary as well as SHRDLU did its. The reason is that for its tiny vocabulary, SHRDLU could have a hand-written piece of code for each concept, and thus have a real understanding, in some sense, of the concept. However, this will change over time. To begin with, people will simply write code for the most important concepts. People will come up with schemes to form new code for new concepts from fragments of old code and/or search methods like genetic programming. Current leading-edge NLP systems (most of them proprietary, AFAIK) are surprisingly good at talking about whatever it is they actually know about, i.e. have a deep semantic model of, as long as you’re literal and prosaic (and expect them to be the same). I think it’s a toss-up whether automatic programming of semantics makes it to hypohuman border this decade — but AI with hand-coded semantics such as Siri seems likely to be ubiquitous, and competent, by 2020.
It’s pure palladium, 8 nm wide, made at the University of Birmingham’s Nanoscale Physics Research Laboratory. h/t Nanowerk In E.E. Smith’s famous Lensman series, the galaxy is the battleground between two races of superintelligent beings, the (good) Arisians and the (evil) Eddorians. When I listen to people who worry that we are about to create a superintelligence which will take over the world, I get the impression they’ve come from reading “Galactic Patrol” and think that we are on the verge of disastrously creating an Eddorian unless we buckle down quick and figure out how to build a friendly Arisian instead. In the books, the superintellects had lots of ESP powers but we can dismiss those. The actual intellectual capability they were imputed to have was the ability to predict. Prediction is of course the sine qua non of intelligence, but the Arisians were able to predict, e.g., five years ahead of time, that a certain man would be sitting in a barber’s chair and a kitten would jump onto his lap, jostling the barber’s arm and giving him a scratch. All from the laws of physics and the knowledge of initial conditions. There are many reasons why this is simply, completely, totally, always forever and truly impossible. First of all the laws of physics are quantum and have a built-in probabalistic uncertainty. By the same token, it is impossible to know the initial conditions of any substantial portion of the universe to any very high precision: measuring a particle necessarily changes its state in a way do not completely know. Second, huge parts of the phenomena of interest, a many levels of ontology, are in dynamic systems that are subject to chaotic behavior. The Butterfly Effect reigns not only in weather, but in markets and politics and epidemiology and computers (one different bit out of a gigabyte can completely change the program’s behavior) and every human mind. Computers are a particularly hard case of this. Very basic theorems of computer science tell us that one cannot in general predict what a program will do without actually running it. This is fine if your superintellect has plenty more processing power than the computer in question, and can emulate it. But the closer the computer you’re trying to predict comes to having your own processing power, the more likely it will surprise you. A weird special case of this is that you can’t even predict a universe if you yourself are part of it, because you are a computer with processing power equal to yourself. (This, BTW, is where our notion of free will comes from: our world models must necessarily exempt our self-models from their general basis in determinism.) You could cheat and force yourself to act in the future according to a list of actions you prepared today, but you wouldn’t be acting all that intelligently; and you wouldn’t be acting with free will, either. A more obvious case is simply a world with two (well-matched) superintellects, in which at least somewhere they are in competition, maybe even just a friendly game of chess. In a game between two identical chess computers, each gets to see one ply deeper into the future than the other one did. Neither can know enough to guess what the other one is going to do for sure. In a world with lots of superintellects, no one will be able to predict any detail on which they compete. The debate held at Foresight 2010 between Robin Hanson and Mencius Moldbug on the subject of futarchy is now online at Vimeo. Watch it online or download it:
Many many thanks to Monica Anderson for doing the video.
My current research in AI, such as it is, is an attempt to build a system that’s capable of understanding the above quote. It’s from the middle of a book, and it is much much harder to understand, fully, than you might think. What I intend to do here is to unravel the process by which someone reading the book could be said to understand it. Largely the concern is about what kind of mental structures are being built and what structures must have been built by reading the previous half of the book for the passage to do what it does in the mind of the reader. Without further ado, let us jump into the quote, which starts at the beginning of a paragraph:
There are two sources for the comprehension of this clause. First is the preceding paragraph, where a fight is described. A scene and script have been built up, like a movie in the mind. In particular, one man is holding a girl (who is struggling to escape) and another is trying to tie her feet. She kicks the second man, and that is the blow that’s being referred to. Unlike a cinematic movie, however, much that would be evident on the screen has been left out. The specific positions of the bodies, the clothing in some cases, and many aspects of the background have been left to the imagination. In other words, the “movie” is a sequence of abstractions. It is in no sense simply a pile of predicates, however. When I read this, I come away with a semi-visual motion script, such as could be used to orchestrate a re-enactment by action-figure dolls, even though the text doesn’t come close to specifying the actual positions or motions I imagine. The second source is the reader’s memories of pertinent experiences, either of watching fights or having been in them. In the multi-level abstraction structure that’s being built, by and large, at least in the hands of a skillful writer, the things that get mentioned are the things that you’d pay attention to if watching the scene. It’s well established, for example in studies of eyewitness acounts in criminology, that people confabulate what happens between such points in their memory of actual events, much less from verbal stories. So to that extent, the structure of a story reflects that of memory. If you’ve ever taken a hard blow to the solar plexus, you’ll have a much deeper understanding of this passage than someone who hasn’t. I have, and the sensation is unique; nothing else in my experience feels the same or has the same effects. If you have, note that among the few descriptions of clothing that were provided was that the girl was wearing riding boots. At a higher level, the scene is part of an attempted abduction of the girl by the men. On this level, the reader is on tenterhooks to discover whether the abduction will succeed, given the girl’s spirited and at least partially efficacious resistance.
Syntactically, this is a bit of a garden path; we expect it to be a conjunct of the previous predicate until we see the comma. It turns out to be a participle introducing the second clause. It appears to be for the benefit of those readers who have not experienced solar plexus blows. It describes the effect well enough to follow the action sensibly, but doesn’t really capture the experience. This points out that there can be different amounts of actual understanding going on in different readers each of whom would claim to have understood the passage: there can be ties to emotions, sensations, memories, and/or mental models in various combinations and strengths.
Perkins is the second man, and his staggering back is completely predictable from the description of the action so far. In fact, it’s predictable that staggering back is part of the process of his collapsing, which isn’t, and doesn’t need to be, stated explicitly. The ability to predict is one of the key elements of understanding, so we can propose that there is a model of collapsing after being knocked out (or struck in the solar plexus) that abstracts away from any particulars about the individual (or his specific position) that allows the extrapolation of the “movie,” if need be. It’s the instrument-board that is the new item. In order to understand this, the reader has to pull into play a much broader background structure than heretofore. The action is taking place in the control room of a spaceship, and the “instrument-board” is its control panel. One is reminded of a programming language variable being looked up in a containing context after being found unbound in the local one.
Now we see that there are multiple disconnected levels of abstraction, as well as disconnected items of physical description, that need interpolation. We hear about the arm and the lever, which is local and concrete. We can imagine an arm striking a lever and pushing it. We still don’t know anything about how big the board is, where the lever is on the board, whether the board is horizontal, vertical, or tilted. We don’t know where Perkins is with respect to the pilot’s (and co-pilot’s?) seat(s). On the other hand, we do know much higher-level things about control panels, and power levers (I, for example, call to mind typical airplane cockpits as well as the spaceship control rooms in SF movies I’ve seen.) Although various things about the spaceship have been mentioned before in the book, no good description of the control room has been given; we have to assemble this as some useful level of abstraction as we read this passage.
This is the key development, not only of the passage, but of the entire book. Note that without the model that the reader will have built up by that point, the phrase means virtually nothing. You might well think that there is a place where drinks are served on the ship.
The really remarkable thing about this phrase is that it throws implications across virtually every level at which the book is to be understood. At the physical level, it describes the closing of a circuit and the application of voltage to a piece of metal. At the technological level, using the (fictional) understanding of space drives built up before, that means that the ship will be placed under very heavy acceleration. Back at the physical bodies level in which we were understanding the fight before, it means that the parties will be thrown to the floor and unable to move. The fight will be at least suspended. At something closer to the plot level, the parties, pinned to the floor by acceleration, will be unable to stop the ship until the fuel runs out, stranding them far away in space. This converts them from abductors and victim, where the conflict is all interpersonal, to fellow lifeboat passengers facing a common doom. There is room for various character development as they adjust to the shift in circumstances. In a larger context, from the outside it will seem as if the abduction succeeded. This will force the girl’s fiance to give chase in his own spaceship (already a hackneyed plot by 1928, to be sure, but thereby all the more predictable for the reader). But the fact that the ship will fly at top acceleration until its fuel is exhausted implies that the succeeding action will be far-removed from the familiar terrestrial scenes it has taken place in so far. In fact it converts the story from one of personalities and struggle in familiar circumstances (a la the Illiad) to a true voyage of the imagination (like the Odyssey). If you are want to understand things deeply, you typically want to call in comparable things which can illuminate them by analogy. In this case the arm on the lever is like the tornado in Wizard of Oz (or of course the storm in the Odyssey); it not only throws the protagonists into a strange new world, but motivates their subsequent adventures with the quest to get home. There is NO knowledge representation and inference scheme in the NLP field today that has even a snowball’s chance of capturing all this. But a human reader with a good grounding in the classics can see that this sentence is the turning point and spark of the whole book on something like five different levels simultaneously. That’s quite a kick.
In something of an anticlimax, Smith is keeping the reader up to date with the physical model of the ship, just in case someone wonders why, having gone down to land, it didn’t keep going down when the juice was turned back on. It had come down backwards, hanging on its thrust. Yet another model — and level of abstraction. Phaedon Avouris, winner of the Feynman Prize in 1999, is head of the nanoscale science and technology group At IBM, which has recently reported significant advances in synthesizing transistors from graphene using conventional lithography methods. IBM Demonstrates Graphene Transistor Twice as Fast as Silicon Graphene transistors promise 100GHz speeds Graphene Transistors that Can Work at Blistering Speeds and the Science paper, 100-GHz Transistors from Wafer-Scale Epitaxial Graphene What does this all mean? Basically, they have overcome a couple of substantial hurdles on the way to a carbon-based electronics, namely the bandgap issue and the ability to fab at wafer scale. They still have a way to go: they need to bring gate length down by a factor of 10 or so to be in the range of silicon, and probably a few more hurdles and a lot of just plain legwork as well. But if the research goes through to development, and the development goes through to manufacturing, we’ll have chips that are about two-and-a-half times as fast as the corresponding ones in silicon. The bottom line, for my money, is that Moore’s Law is safe (in the sense that it will continue to hold true) for another decade at least. I don’t see this as being a huge spike ahead of Moore’s Law, since graphene has a lot of catch-up to play, but in the long run it probably has more upside potential in speed and size, especially if/when they can get those nanoribbons atomically precise. The first AI blog was written by a major, highly respected figure in the field. It consisted, as a blog should, of a series of short essays on various subjects relating to the central topic. It appeared in the mid-80s, just as the ARPAnet was transforming over into the internet. The only little thing I forgot to mention was that it didn’t actually appear in blog form, which of course hadn’t been invented. The WWW didn’t appear until the next decade. It appeared in book form, albeit a somewhat unusual one since it was, as mentioned, a series of short essays, one to a page. It was, of course, Marvin Minsky’s Society of Mind. Of course, you’re reading a blog about AI right now. The difference is that that was Minsky, and this is merely me. If you haven’t read SOM, put down your computer and go read it now. Good. You’re back. Here’s why SoM is relevant to our subject of whether and how soon AI is possible:
In other words, here’s a comprehensive theory of what an AI architecture ought to look like that is the summary of the lifework of one of the founders and leaders of the field, and yet no one has seriously tried to implement it. (When I say serious, I mean put as much effort into it as has gone into, say, Grand Theft Auto.) (There has been a serious effort to implement the theoretical approach of the CMU wing of classical AI, namely SOAR.) Part of the reason for this is that SoM is in some sense only half a theory:
… but SoM doesn’t have a lot to say about what the individual functions are or how implemented, outside a few examples. Since AI has for the past few decades concentrated on immediate results, most of the work has been on parts of the problem that could be described as stuff that would be inside a single agent, or at most an agency. A good example of this happened a few years ago with the winning of the DARPA Grand Challenge and thus the development of the self-driving car. A few months after that happened, I was having a conversation with an AI researcher at a conference. I maintained that the difference between the results of the first and second races — nobody got more than a mile or so, and then a couple years later several cars finished the whole 130-mile course – represented real progress. He pooh-poohed the idea. All the techniques used in the cars had been previously known and published, he said. All that had happened was that they had been integrated together into a working system. I think this attitude goes a long way to explaining the lack of work on SoM and other overall cognitive architecture theories. But as I reasoned previously:
SoM represents a theory of how the control might work. Where does that leave us? Can we simply take Minsky’s books and papers and build an AI with all the existing narrow skill programs acting as agents? Hardly. There’s a lot of work to be done, and probably several new Good Tricks left to be found. The bottom line, though, is that we are not facing a blank wall. We are facing a corridor with a sign reading “This way to the egress.” Indeed we are partway down the corridor already; robotics and self-driving cars have required the development of integrated cognitive architectures along the lines that will probably lead to success. Note that Brooks’ subsumption architecture had a lot in common with SoM. So there is at least a case to be made that we are into the home stretch. Of course that’s where the race really heats up and all the excitement happens… So far, in making my case that AI is (a) possible and (b) likely in the next decade or two, I’ve focused on techniques which are or easily could be part of a generally intelligent system, and which will clearly be enhanced by the two orders of magnitude increase in processing power we expect from Moore’s Law by 2020. (Note — we certainly don’t have to wait till 2020 to find out. Existing hardware is well into the usable range, probably for less than $1M. But you don’t get too many researchers, and no hobbyists, doing their research on machines like that today. You will in 2020.) To make a heavier-than-air airplane fly, you need an engine. If you have an airframe with lift-to-drag ratio r, stall speed s, and weight w, and a propellor with thrust efficiency e, you need an engine with power p=swr/e to fly. Power<p, no fly. Power>p, fly. Both of the major American flying machine efforts understood this. Langley spent huge effort developing light, powerful engines. The brothers Wright built their own aeroengine from scratch in their bicycle shop. The difference was, the Wright brothers knew an extra Good Trick, which was how to control the plane in the air once it was flying. So to develop a working AI, we need the power, which we don’t think is going to be a problem. We need the lift, which is the kind of techniques found in narrow AIs and discussed above. And finally we need the control. What I just said is an example of reasoning by analogy. To an extent much greater than usually realized, most cognition and reasoning is based on analogy. When you perform a physical skill, the specific sequence of sensory and motor signals is never exactly any of the ones that happened during practice; but they’re close enough that the mapping is straight-forward. This is something that is well-known to the AI mainstream:
The particular kind of reasoning by analogy that would make an associative memory machine work well can be called analogical quadrature. This is the form of problem done most famously by Melanie Mitchell’s Copycat program: you have three things A, B, and C, and you want to find a fourth D such that A:B::C:D. In the associative memory scheme, you need to do not the actual action you did in the memory, but the action that fits the current situation the way the remembered action fit the remembered situation. As a simple example, if the remembered action was done by someone else, the parallel could be mapping things so that the action is done by you this time. In other words, analogical quadrature enables imitation. If you can somehow represent your concepts as points in an n-dimensional space, analogical quadrature is falling-down easy: D=C+B-A in ordinary vector algebra. Of course, sometimes the mapping into n-space is problematical, and we are thrown back on symbolic methods such as those of the FARGitecture. Those have their own problems, essentially the same ones as any symbolic AI: the operations and ontology in, e.g., Copycat are all idiosyncratic and hand-coded, and there’s no clear way to build a learning machine that extends them automatically. I’ll go out on a limb and guess that the ultimate solution will involve elements of both extremes. Search will be needed both to find new operations for symbolic formulations, and to find appropriate mappings into n-space for the subsymbolic ones. A few key insights — new Good Tricks — will be necessary to unify the known methods and give us a solid understanding of, and engine for, analogical quadrature. That’ll be a huge step towards general AI. AI researchers in the 80s ran into a problem: the more their systems knew, the slower they ran. Whereas we know that people who learn more tend to get faster (and better in other ways) at whatever it is they’re doing.
The processing power necessary to to that kind of parallel matching is high, but not higher than the kind of processing power that we already know the brain has. It’s also not higher than the processing power we expect to be able to throw at the problem by 2020 or so. Suppose it takes a million ops to compare a sensed object to a memory. 10 MIPS to do it in a tenth of a second. A modern workstation with 10 gigaops could handle 1000 concepts. A GPGPU with a teraops could handle 100K, which is still probably in the hypohuman range. By 2020, a same priced GPGPU could do 10M concepts, which is right smack in the human range by my best estimate. Associative memory gets you a lot. You don’t have to parse an unknown object for algorithmic retrieval. You don’t have to come with some one-size-fits-all representation and/or classification scheme. Indeed, each object in memory can have its own representation if necessary or useful. It gets better. The memories aren’t all, or even mostly, objects. They’re typically actions. Let’s suppose the actions are represented as situation-action-resulting situation triples — something like Minsky’s trans-frames. Then we can use the associative memory to
There was an attempt to do this kind of thing in mainstream AI under the name “case-based reasoning” a couple of decades ago, but it appears to have foundered for several reasons, not least of which was the inability to do heavy-duty parallel matching on extensive memory sets. There are a number of things that need to be added to the scheme for it to be useful and robust, like embedding it in a hierarchical, multiagent architecture, the ability to do analogical quadrature, and the ability to find useful representations. But that’s for another post. Bayer (the same company that makes the aspirin) is now beginning to manufacture multi-walled carbon nanotubes in industrial quantities. The pilot plant will produce 200 tons per year, and the market is expected to grow at 25% per year. The MWCNTs are for materials use, meaning mostly fiber-reinforced composites, e.g. airplanes, tennis racquets, arrows, and the like. The major advantages over conventional polymers / fibers is that the CNTs are stronger and conductive (both electrically and thermally) — producing a plastic that is more like metal in many ways, but still much lighter. The conductivity is supposed to be comparable to copper, i.e. good enough to use as wiring in many applications. Looking at the data for CNTs as a polymer additive, the major effect on mechanical properties was to make them less stretchy (and about 10% stronger), while having a major effect on conductivity properties. Nobody has yet, as far as I know, managed to figure out how to make a composite that has the really high tensile strength possibilities of the raw nanotubes. Alternatively, CNTs in light metal matrices such as aluminum or magnesium seems to have significant possibilities. Time will tell — but there’s still a major advance to be made. The individual CNTs in the mix are on average 8 or so walls, 15 nm diameter, and over a micron long (i.e. an aspect ratio of at least 60 and probably in the hundreds). |
||||||
|
Copyright © 2010 the Foresight Institute - All Rights Reserved |
||||||