I’m a postgrad. Tradition dictates dictates that I have a “topic”, by which is meant a research area, for which I possess an abiding, obsessive love, which love I think it best to express by writing long love-notes to said discipline which I will proceed to hide in the library basement. This is what we call a “dissertation”. I don’t have one right now. This will cause consternation in my confirmation hearing. Here, watch me attempt to rectify this in public.
My undergraduate degrees are in mathematics and human geography. My professional experience is in software development for large-scale data visualisation. My interests are algorithmic music, open-ended creative systems, the economics of sustainability and technological innovation, and in general questions of multi-agent learning. “Collective dynamics”, if you like. The flux of hidden orders that constrain the chaos.
Dynamics of collective learning, with an emphasis of what this means at the economic end of the spectrum.
What kind of stochastic process is multi-agent learning in human beings? How can we model it? Artificial chemistry? Wolpert-style COINs? Feldman-style “Turing gas”? Pattern-matching string transformations? Information theoretic bounds on agent model updating? A network representation of hypothesis formation? For what its worth, my intuition is that something like a combination of artificial chemistry and a statistical model of pattern matching could give some insights into a toy model
If the aggregate system is, as seems likely, unpredictable, what bounds does knowing about an underlying stochastic process place on system evolution? How different would technology be if we “ran the tape twice”? Is there an underlying topology to what innovations can be fostered? Surely the limits imposed by every individual agent’s learning imposes certain constraints on what overall structures can be evolved.
If the adaptive process of innovation is constrained by the structure of adaptive human learning, how is it constrained by the underlying physical reality? Reality, viewed from the perspective of making and testing hypotheses about it, is not a homogeneous state space with constant reward, but must possess a fitness landscape that favours some combination of truth and ease of applicability. (Solar panels work best in the desert, anti-gravity machines don’t work anywhere, Newton’s equations of motion are more readily deducable than relativistic ones at the velocities at which we commonly operate, ready availability of fossil fuels favours polymer-based construction materials etc - do you get my drift?) Can we capture that in some sense?
The attraction of thinking at this degree of abstraction is that it seems like a contender to remove the computational intractability of agent-based models - we can hope that processes of innovation are approximately static, that processes of diffusion are approximately like the standard contagion-on-a-network models, that learning rates by something like a reinforcement-learning convergence rate, and that we can get a model that will show us how variable and weird actual innovation processes are, and how abstract information flows are constrained by reality, but avoid having to simulate decision processes for millions of human agents. Of course it would be an odd system, dominated by black-swan events, innovations that are unlikely for any one individual to make but then rapidly propagate. Average over the whole earth, no individual is likely to invent the transistor, but a global economy that fosters the possibility of such events will crack it sooner or later, say. Closer to Paul Ormerod’s stochastic shocks than Ising’s spin glasses.
On that latter, physical technology note, I suspect that the fitness landscape of economic innovation would have something to do with informational constraints of human learners, but also of the geophysical landscape - what sources of free energy are available? How readily may these be transmitted? There is a lot of work with this in material stocks and flows analysis, and also in the field of ecology. It seems this is related to your “free energy” link, though a more common term and slightly different term from the engineering/ecology literature is “exergy” - thermodynamically available energy. (Approximately - how much energy Laplace’s demon can on average extract from a system with noise.) The free energy thing seems to rest a bit to heavily upon equilibrium hypotheses for my taste, but this underlying computational mechanics idea that a combination of available energy plus information about how likely it is to be available is one of the underlying drivers for learning/replicating systems, that seems reasonable.
Interaction of adaptors on different time-scales - evolutionary versus cultural time-scales, dynamics that are hard or easy to learn, frequent and infrequent event-types... can any regularities survive such heterogeneity? Convergence theorems applied to nonstationary targets.
So, questions this approach seems useful for are: what are the transitions paths to non-carbon-intensive energy systems? How can we quantify the “disruptiveness” of a technology? Can we identify unfilled technological niches in this way? What would a society based on alternative energy forms look like? Which industries are dead in the water?
What is a good research étude toward these large issues?
How is it that a scalar value, price, gets to subsume so much information about return upon an investment? Decision theory says that, given a certain valuation of different returns upon investments we can value any given risk. Portfolio theory tells us how to attain the optimal balance of risk and return etc. Risk, of course, has more dimensions than just static distributions of returns, but we can somehow ignore that a lot of the time. Is it that markets are structured to support a scalar pricing of assets in such a way that we can ignore systemic black swan events? Are they systems that minimise the frequency of Black Swan events? If so, what does that say about them? How much endogenous uncertainty can we pave over in this way? can it be quantified, the limits to making options quantifiable?
It seems that the go-to-guy here is Didier Sornette. I’ve Been reading Malevergne and Sornette’s “Extreme Financial Risks”, it it seems, at a first glance, to a have a chapter devoted to each of my sub-questions here. Sigh.
I’m working on a model of financial analyst hiring so I can have at least one Agent-based model under my belt. Details soon, under Groysberg Traders.
How much of economic growth is attributable to the increase in usage of resources? How much to population/workforce growth? And how much is due to increasing technological sophistication? Or to other things? How can the process of exponential increase in our GDP be made comprehensible?
I’m short of data with which to explore problems. Welcome to the ubiquitous ICT epoch though, I can definitely get access to some of those classic data sets online to bootstrap my enquiries.
Economics is so successful in part because it is a self-quantifying system. Prices are already, definitionally in commensurable units. What brilliance, to launder so much complexity through such a narrow bottleneck. Anyway...
Twitter firehose
Patent citation networks (these are available and reasonably well annotated)
MMOG play data might be available and is an amazing source of information here.
Wikipedia articles and their references (readily available)
source code of large collaborative projects (Linux or BSD kernel, openoffice, python, Perl, GCC etc)
This is possibly a low-hanging fruit for me - I’ve got a fair bit of experience of parsing, and SCM-wrangling. But is software engineering a good proxy for physical engineering? Can I parse other technical standards in the same way? (I know a few engineers - must ask them)
Brand registration records. (Thinking of Holland-style tagging hypotheses here)
Journal article cross-references. This is quite well-studied.
Estimating number of SKUs as a surrogate for divisions of a modern economy a la Beinhocker (lots of research into this because of Long Tail theories, though the primary data is rarely included - might chase this.)
Bank or other firm lifetimes
Stocks and flows analyses of the economy (These are big in the ecological economics literature, once again, without primary data usually)
We have Census records also
...and public data providers such as infochimps.
What else is digitised? Consumer purchase data, traffic flow data?
How about - what small subsystems in here are documented well?
Groysberg’s Chasing Stars model benefitted from an industry which goes to the trouble of collecting data on itself for him. Do we have any equivalents in areas I am actually passionate about?