Blog - Stuart Mathews

Details: Category: Blog; By Stuart Mathews; 02.Aug; 02 April 2025; Last Updated: 08 September 2025; Hits: 741

Philosophy

I've created a systematic approach that aims at assessing certain aspects about papers that I'm reviewing.

The first phase (Context and Understanding) is meant to cut to the core of the underlying research that is being presented. It is also hoped that by analysing the papers in this fashion, justification and reflection can be undertaken when considering individual aspects.

The second phase (Methodological Issues) aims to consider the threats to the research's validity with a view to spotting problems within the research process itself.

The example column is based on a review done in Reviewing A Fast Learning Algorithm for Deep Belief Nets.

I	Phase 1: Context and Understanding
#	Aspect	Description	Example
1	Title	Title of the research paper	A Fast Learning Algorithm for Deep Belief Nets
2	Topic	The general topic of the paper	Restricted Boltzmann Machines
3	Topic Details	More additional details, if needed, to better articulate the description	A description of an algorithm that, when applied to an existing DBNs (which comprise RBMs), will improve its performance.
4	Year	Year of publication	2006
5	Research Area	The particular research area this topic falls under	Deep Learning (Deep Neural Networks)
6	Type of research paper	Is it a survey or a topic-specific paper	Specific topic in the research area
7	Author(s)	The authors of the paper	Hinton, G.E., Osindero, S. and Teh, Y.-W
8	Research Question	The research question the paper is addressing	How can the performance of DBNs be improved? How can the effects of 'explaining away' be eliminated in DNNs?
9	Problem	The problem the paper is going to solve	The phenomenon in DBNs of 'explaining away'
10	Problem Analysis	More detailed analysis of the problem
11	Research Deliverables, Promises and/or Objectives	Deliberables/objectives typically suggest that data and a specific technique must be used to	Show how using 'complementary priors' removes explaining away. Derive an unsupervised learning algorithm that uses complementary priors Describe a hybrid neural network that uses associative memory and 3 hidden layers Use an algorithm on a neural network to prove the algorithm's effectiveness The performance of the model is FAST and ACCURATE
12	Aim of research	In what perspective does the paper aim to solve the problem?	Describe
13	Research Aim Analysis		To describe how an algorithm works to solve the phenomenon of explaining away, which reduces the performance of DBNs. It also explains how the algorithm can solve the problem, and explains what the phenomenon of 'explaining away' is
14	Philosophical worldview	What view do the authors have, eg, Objectivist	Postpositive (Objectivist), Pragmatist
15	Philosophical world view analysis	Details, justification for the indication of a particular philosophical view.	The author uncovers why DBNs have lower than expected performance, which is due to a phenomenon called 'explaining away'. In this respect, they figure out what it is and how to fix it, thereby revealing a previously unknown approach to this problem. This, therefore, the researchers employ a postpositivist approach to knowledge, accepting that there is knowledge/truth that must be discovered, which is what this research does. Equally, the research is also very pragmatic as demonstrated by the implementation of the algorithm in a DBN that is shown and can be of general use, particularly in this case in classificying ot generating digit characters, but such that this approach could be applied to any generative model with any data.
16	Research Design Philosophy (Empirical or Non)	What is the general philosophical approach to the design of the research undertaken in this paper?	Empirical/Quantitative (Gathering of observed/experienced data)
17	Research Designs	Specific research design based on the underlying philosophy of the research design (Empirical or Non-Empirical)	Design Science Research (Empirical) Experimental (Empirical) Statistical Modelling (Empirical) Algorithms design (Non-Empirical)
18	Research Design Analysis		The research uses experiments to showcase the results of the algorithm developed. It is therefore based on experimental and empirical observation. The Neural network used is in the testing inherently a statistical model that results in quantitative outcomes, which serve to model its results. As the approach is primarily the showcase of both the description and implementation of the algorithm, as applied to an existing approach (DBNs), it could be argued that this research is also a design science piece, as it is the development and performance of the resulting model that is explored.
19	Research Methods	The process or manner in which research is conducted. Each paper has a single method of research.	Experiment (Quantitative measurement)
20	Analysis of Research Methods	What does the method consist of, and what are its characteristics?	The research uses experiments that use existing character digit datasets as input the the model, and the results are then evaluated. The model produced quantitative results that represent the model, which therefore makes this research an application of quantitative measurement based on experimenting with the model (which is enhanced by the algorithm developed in this research).
21	Reasoning Process	Where are the conclusions drawn from? Typically, interpretivist approaches use induction to start with the data to derive a theory/understanding.	Deduction - Theory first, data later
22	Reasoning Process Analysis	Where are the conclusions drawn from? Data or theory	The underlying theory of 'explaining away' is discussed, and an algorithm that aims to eliminate it is implemented in a model that is tested using a dataset (MNIST). In this way, a theoretical idea is tested or proven using data. Therefore, this is a deductive process initiated with a theory and supported by an implementation of an algorithm based on that theory and applied to a model, which is evaluated with data to test whether the reasoning behind the theory is correct.
23	Answer/Solution to the problem	How does the paper solve the problem that is stated?	DBNs should be pretrained layer-by-layer, followed by fine-tuning to substantially improve learning.
24	Analysis of the Answer/Solution to the problem	Characteristics of the solution.	DBNs should be pretrained layer-by-layer using an unsupervised algorithm, followed by a fine-tuning supervised algorithm to substantially improve learning. Use a fast learning algorithm based on complementary pairs to configure DBNs for training
25	Modes of inquiry	Scientific: Based on research procedure and empirical testing (Objectivist and subjectivist) Non-scientific: Belief, tradition, intuition	Scientific
26	Mode of Enquiry Analysis		The science research, basing its approach on statistical methods, specifically the actual use and implementation thereof, uses a neural network, trained with the developed algorithm that is tested and shown to produce observable indications that the algorithm solves the problem presented in the research. The approach is objective, as it uses existing datasets to show the effectiveness of the algorithm (when applied to the model). The algorithm is also by definition repeatable, making this easily testable and therefore the results are not based on subjective or varying procedures. This means this process can be automated.
27	Data (acquired)	1. What is the acquired material? 2. What is the data collection process? 3. What are the characteristics of this data?	MNIST dataset: 10,000 character digits, grey-scale 32x32 images used to train the neural network (pre-trained with a new algorithm) The input dataset for the model testing is the MNIST dataset of character digits, which is a repository of 2D images that are well-known and used by researchers for the classification of character digits. The output of the model is numerical data that indicates/predicts the classification of the input data belonging to specific classes of digit characters.
28	Analysis of Research Techniques/Tools	What are the techniques/Tools and their characteristics?	Contrastive divergence algorithm Gibbs sampling (Hidden Markov Monteo carlo method) Greedy-layer-by-layer algorithm (complementary pairs) DBN (Deep Belief Network) Various learning algorithms (Backpropagation, SVMs, squared error and online updates, LeNet5 CNN, cross entropy, etc.) Generation of an image using a learnt model The main technique is experimentation using a neural network with 3 hidden layers and using the developed pre-training algorithm to test how well it removes the 'explaining away' in order to improve the network's inference performance. The MNIST data set is used as training data for the network. The algorithm that is developed is applied to a DNN model (DBN), and it is then tested by experimenting on the model to see the performance that results. The results in an error rate of 1..25% in comparison to the closest rival, which is SVM at 1.4%.
29	Research techniques (data collection and analysis)	How data is gathered, analysed and inferences drawn	Experiments, Statistical analysis, Neural Networks
30	Information (Analysis)	1. What does the data tell us? 2. What meaning is established by analysing the data? 3. What is the nature and the characteristics of this information?	The MNIST data is processed using the neural network, resulting in output from the neural network (model). The results from the neural network show that using a pre-training algorithm that configures/trains each layer using complementary pairs improves the performance of DBNs, i.e reduces the error discrepancy in predicted vs actual outputs
31	Knowledge (general)	1. How has the information obtained been useful, particularly in a wider sense? 2. What is the nature and characteristics of this knowledge?	The phenomenon of 'explaining away' that occurs in DNNs (of stacked RMBs) restricts their performance. Using complementary pairs to configure/train each layer to establish initial weights removes 'explaining away' and results in a better-performing neural network.
32	Parent/Origin	The origin or parent paper that this paper was referenced. For example, a Survey paper	A survey of deep neural network architectures and their applications
33	Peer reviewed?		Yes
34	Relevance, Contribution, Originality and Novelty	1. What explicit theoretical dilemma and conceptual underpinning (not just gap) in CS has inspired this research? (See problem) 2. How would truth or fact be evaluated in this research? 3. Which concepts/models/assumptions from the discipline (CS) are integrated into my methodology? 4. What are the implications of this research for the wider field? (see research impact) 5. How do the results challenge or support existing approaches to truth/ fact and theory?	A key aspect is that Hinton et al have identified and understood exactly what the problem of explaining away is, and so were able to create an algorithm to circumvent it. The improvements to the Performance/Learning of/neural networks as a result of this new algorithm improve the performance of all DBNs, and therefore have a great/wide applicability to all domains that use DBNs. The results of the paper are very generalizable. Another particularly interesting aspect is that the paper shows a way to determine what the model learnt by generating an image based on the learnt weights to 'see' what and how it learned the dataset.
35	Popularity	Refer to aspects such as the vastness of citations and collaborations, etc	High
36	Popularity Analysis	What indicates the popularity? Number of citations?	IEEE reports 3888 citations while ACM reports 3280 citations. This suggests that this is a very popular piece of research. There is also credence given to the fact that the researcher (Hinton) has an important influence on the AI research community in general. IEEE: https://ieeexplore.ieee.org/document/6796673 ACM: https://dl.acm.org/doi/10.1162/neco.2006.18.7.1527
37	Research Conclusions	Analysis of the conclusions. 1. Are they subjective? 2. Do they have credibility?
38	Causation vs Correlation analysis	How are variables, causation and correlation dealt with in the research?	Variables: Learning approach (with or without greedy layer-by-layer algorithm) MNIST dataset Neural network configuration After using various types of comparative learning algorithms in comparison to the research's approach (which uses the greedy-layer-to-layer pre-training algorithm), the same dataset is used (MNIST) throughout, therefore only the approach to learning changes. This means each model's discrepancy error is evaluated until the lowest value is found to see which model causes the discrepancy value to be the lowest. The neural network configuration is unchanged.
39	Researcher's notes	Notes that were taken during the reading of the paper. (NB: see existing notes written on/in the papers also)	It's interesting to think of an observation as a result (situation/response) of possible causes (weighted inputs/events) This might be a way to model dealing with stimulus and response pairs as events
40	Researcher's TODO	These are actions that the researcher would like to take as a result of reading the paper	Learn more about generative models It seems that I'm more aware of discriminatory models. Find out what maximum likelihood and contrastive divergence learning are What is a function of a discrete variable? Understand what the problem of explaining away really is
II	Phase 2: Methodological Issues (Subjectivity/Objectivity)
	Threats to validity
41	Construct Validity	Analysis of the approach to measuring the construct under investigation. Any threats, oversights, assumptions or naivety or other risks that might affect the construct validity of this research.	No obvious flaws
42	Internal Validity	Analysis of cause-and-effect within the research (Truth in the research). Any threats, oversights, assumptions or naivety or other risks that might affect the Internal validity of this research?	See below
	42.1) Research Correctness	How was subjectivity removed from the outcome, or how is objectivity ensured? E.g.. How reproducible is the research and why?	Objectivity The same constant data is used as was used by others (MNIST) The research uses an objective measure of performance (inference error) using the dataset shows that its error rate is better with this model than with previous models. Subjectivity/Specificity No obvious flaws
	42.2) Research technique	How is subjectivity removed using the technique? What aspects of the research technique are too specific and risk degrading/compromising the real truth?	Objectivity: Algorithms used in this research are repeatable and inherently automatable. This means all parts of the process, i.e, data, model, and algorithm, are non-varying in nature and therefore can be replicated/verified by third parties. The comparison of alternate models' performance on the NMIST dataset is suitable for evaluating how the pre-trained model's performance compares to those models that do not use it. The research techniques fit the requirements of this research. Subjectivity/Specificity: The research techniques are limited to only used on 2D image data (pixels) Only a 3-layer neural network is used
	42.3) Research techniques vs research question	Are the research techniques appropriate given the research question?	Objectivity: Varying the application of the learning algorithm while keeping other parameters constant, i.e the common data (MNIST) and the design of the neural network design means that it is simple to evaluate the effect of varying the single variable, i.e, the application of the algorithm, making only the algorithm the independent variable. This supports the research question as the neural networks' output/performance (error function) of the classification task directly indicates if the neural network worked better than other results from other models that had not used the pre-training algorithm. Subjectivity/Specificity: The performance is only measured using data that reflects 2D character images. Larger images or more complex images are not assessed. Only a specific configuration/design of the DBN (3-layer is used) to remove the effects of 'explaining away'
	42.4.) Conclusion vs methods	Are the conclusions appropriate given the methods used? What risks jeopardise the truth of the conclusions?	Objectivity: Using experimental results based on empirical testing, observation, and comparison supports the conclusion that the research's specific approach is better than the other approaches that were tested. Subjectivity/Specificity: Experiments were based on only 2D image data (pixels) so the conclusions can only be representative of character-based image data
43	External Validity	Analysis of generalizability (Truth in real-life). Any threats, oversights, assumptions or naivety or other risks that might affect the External validity of this research?	There is no evidence presented that this approach has or will generalise well to wider applications.
44	Data Validity	Analysis of the data gathered to represent the construct being described. Any threats, oversights, assumptions or naivety or other risks that might affect the Data validity of this research.	See below
	44.1) Data subjectivity (specificity/narrowness)	How is data subjectivity eliminated? How specific is the data and its use/nature itself, limiting the impact of the research or the purported truths that the research suggests by using it.	Objectivity: Image data for a neural network classification task is appropriate for evaluating the learning of a neural network for the classification of this data against known classification labels. The research data is also a well-known dataset that is often used for testing classification performance in models, and so it is appropriate for this type of research. Subjectivity/Specificity: Only the MNIST dataset is used, so the data used to represent the solution presented in this research is limited. This limits the research's outcomes and approaches to dealing with small geometric character recognition.
	44.2) Data vs Research Question	How well does the data support the research question?	Objectivity: Varying the application of the learning algorithm while keeping other parameters constant, i.e the common data (MNIST) and the design of the neural network design means that it is simple to evaluate the effect of varying the single variable, i.e, the application of the algorithm, making only the algorithm the independent variable. This supports the research question as the neural networks' output/performance (error function) of the classification task directly indicates if the neural network worked better than other results from other models that had not used the pre-training algorithm. Subjectivity/Specificity: Only 2D character digits, pixel information is used to show how the techniques in the research improve inference performance.
45	Summary of general risks to validity	General concerns, risks, limitations and assumptions.	The paper is very technical It relies on an understanding of many different ideas and processes such draw deeply on existing knowledge. Those inexperienced researchers may find it difficult to validate construct and internal validity without being well acquainted with the theory, algorithms and approaches discussed.
	45.1) Credibility concerns	Are there noticeable flaws or limitations in the research? This can be high-level level such as a lack of references, large gaps in bibliography, dependency on specific authors (lack of balance) or other missing or inadequate observations/aspects that make the research less credible.	Objectivity: There are gaps in the referenced papers; however, as this paper tests a new algorithm using experimentation and comparison with other approaches, the literature is less influential. In this respect, the literature is relatively objective. Subjectivity: The research is 19 years old (2025) and techniques here could be outdated or have been improved by subsequent research possibly making this research deprecated.

To take the information gathered and put it into a more presentable form.

An example of this approach is Reviewing A Fast Learning Algorithm for Deep Belief Nets. The outline of which looks like this:

This research review approach is currently under development, but you can use it if you'd like a place to start.

Details: Category: Blog; By Stuart Mathews; 31.Jul; 31 July 2025; Last Updated: 02 August 2025; Hits: 505

Situation Detection

Since Thoughts on Bayesian Networks I've been thinking about using Bayesian networks as a means to identify situations.

In How Bayesian Networks learn, various conditions, aspects, circumstances or situational occurrences are captured within a single observation. For example, if you were observing/recording weather conditions, each observation could be composed of co-occurring aspects, such as the current humidity reading, the sunshine level, or whether it is raining or cloudy. These co-occurring aspects might well be thought of as defining particular situations.

These co-occurring aspects that make up particular situations (combinations of co-occurring aspects) can be used to predict/infer the probability of one particular aspect occurring in the future, given that we know how often combinations of their co-occurring aspects occur at the same time. The more frequently they occur, the more probable the prediction. The process used to work this out is called marginalising over the priors, and the process requires computing a Conditional Probability Table (CPT) for the particular target aspect you are looking to predict. In doing so, this table derives general, relative probabilities for your target aspect based on the various combinations of other aspects that were also present when your target aspect was present.

We might then try to model situations as the association of captured momentary aspects (as events) and identify recurring aspects, thereby revealing/discovering situations as they occur more frequently.

Details: Category: Blog; By Stuart Mathews; 29.Jul; 01 April 2025; Last Updated: 06 September 2025; Hits: 1239

Questions

How can we create more realistic and adaptive non-player characters (NPCs) in video games?
How can the pursuit for autonomous, self-aware NPCs help to model, develop and test new and novel real-time learning methods and techniques?
Can modelling intelligent virtual entities in real-time computer games help to inform models for self-awareness in broader fields such as robotics or more general software applications?

Introduction

NPCs in games often follow predefined scripts for behaviour, but research is working towards AI systems that can adapt to player behaviour and react in realistic and contextually appropriate ways to make the characters feel more dynamic, responsive, and lifelike.

For example, research in Behaviour-driven AI considers how to model NPC behaviour using techniques such as Finite State Machines (FSMs), Behavior trees, and the use of Machine learning models and Goal-oriented action planning (GOAP) to model adaptive NPC behaviour.

Research in Environmental Awareness focuses on techniques that create NPCs that can react dynamically to the changing world, such as modelling sense and perception, while pathfinding and navigation techniques allow for reactive movement and adaptive routing choices.

Research in Emotional AI aims to enhance immersion and realism by modelling emotional responses through the development of personality frameworks and modelling relationship dynamics.

Research in Memory and Learning focuses on persistent memory in NPCs and learning from players’ past behaviours, while Advanced Combat and Strategy focuses on developing adaptive combat tactics and strategic decision-making.

Lastly, the research in Social AI aims to model NPC group dynamics, social interactions, conflict and cooperation.

These areas of active research all aim to improve the adaptability and realism of NPCs in games and help to situate the proposed research.

Index

Research Proposal

The proposed research is to undertake the creation of a virtual reality simulation framework for the research and development of NPC behaviour based primarily on modelling and developing the real-time decision-making faculties (or the ‘brain’) of these virtual entities based on their momentary perception of their circumstances using virtual sensory observation.

Specifically, the simulation framework will utilise computer game technology to initially develop real-time sensory virtual environments (worlds, realities) for developing adaptive NPC behaviour based on perception, observation, and with a special focus on the exploration and characterisation of the ‘unknown’ within these dynamic sensory environments.

In these environments, NPCs will be able to ‘sense’ and develop ‘experience’ through the interaction and sensation of other ‘objects’ (including other NPCs), thereby creating a real-time situational environment in which experiments can be undertaken to create and test new models (or simulate hybrid models) to contribute towards adaptive and real-time AI and NPC behavioural research.

Sensory virtual environments

The premise of this research is that through the simulation of an autonomous entity or entities (NPCs) that have been afforded the facilities to ‘experience’ and ‘perceive’ sensory information, and an environment that provides sensory information, that encountered situational experiences in the first instance can be identified and studied, codified and measured, and therefore provide situational data for analysis that incorporates learning, adaptive behavior and detection/classification of situations.

Observations might include the perception of unknown entities such as other characters, objects, heat, danger, etc., within the virtual reality, and equally the sensation of multiple aspects of an observation that might define and identify the makeup of an ongoing situation.

In this way, objects and the environment that the agent interacts with will need to possess characteristics and properties that can be sensed.

Modelling circumstantial perception in game characters based on the assimilation of surrounding sensory experience, specifically through empirical observation, is a key aspect of the simulations proposed in this research.

An important aspect of such a simulation is an environment that can be designed to emit dynamic circumstantial sensory information as part of the virtual reality, including variable, constantly evolving artificial entities or game objects like NPCs, behaviours and other environmental variations that the learning agents can perceive.

A possible model, for example, might be by sensing stimuli and response sequences that are encountered within the sensory environment, and observing their coordination and organisation patterns. This might help build a hypothetical model that can describe what is initially unknown, but which is otherwise empirically ‘experienced’.

Research by Lewis et al. (2015), for example, produced a general architectural blueprint (Figure 1) for designing self-aware systems that use sensory input to develop, refine and enhance levels of awareness in software to “…achieve sophisticated autonomous behaviour by adapting themselves at runtime and through learning processes that enable ongoing self-change”. Recently, this architecture has been specialised by Alemaw et al. (2025) to further model the interactions between autonomous agents.

Reference architecture for self-aware and self-expressive computing systems (Lewis et al., 2015) — Figure 1: Reference architecture for self-aware and self-expressive computing systems (Lewis *et al.*, 2015)

Examples of possible sensory virtual environments (in which NPCs would be situated) are shown in Figure 2.

Figure 2: Potential 2D and 3D sensory simulation environments (Mathews, 2022)

Outcomes

Aim

Using computer game technologies to simulate real-time experiences of autonomous and self-aware NPCs for developing new and novel real-time models for characterising the ‘experienced’ unknown, improving contextual adaptation, behaviour and situation detection in NPCs through exploration and learning within dynamic and sensory-based virtual environments.

Outcomes

Simulation framework for modelling approaches for creating NPC behaviour based on sensory observation.
1. Methods and techniques for representing and delivering sensory stimuli in virtual environments to NPCs (observational protocol)
2. Methods and techniques for implanting self-awareness in NPCs using sensory observation from sensory-based virtual worlds.
Models for simulating sensory-based online learning approaches, such as:
1. Real-time situation detection using temporal networks
2. Real-time situation changes/boundaries using scene graphs
3. Dynamic behaviour generation using dynamic behaviour trees
Research findings and observations of simulated approaches

Design Approach

Game engines such as Crytek’s Target Tracks Perception System (TTPS) (Rabin, 2014a) and Epic’s Unreal’s AI Perception System (‘AI Perception’, no date) are two examples of perception systems that base their architecture on the notion of a stimulus being elicited by a source object in the world, and perceived by an agent. This essentially provides the agent with the ability to subscribe to sense events and have sensations delivered to them when they are emitted by event sources in the world.

The perception system used in the game, Mark of the Ninja by Klei Entertainment, is described in (Rabin, 2014b) as a “…sense detection architecture”, whereby certain “interest” objects (interest sources) can be placed in the world, and can be sensed by agents.

These so-called ‘interests’ themselves determine if an agent (or agents) can be perceived by them at any moment in time (they are updated in real-time in the game loop), and could be good candidates in this research for modelling the implementation of virtual emitters and sensors within the virtual environment for stimuli-response data collection purposes.

For example, figure 3 illustrates a character, labelled as ‘Player’, who is registered as such an interest source that provides sensation events such as auditory stimuli to surrounding interested parties when they are within an established sensation proximity radius.

Figure 3: Player as interest source producing audible stimuli (Rabin, 2014a)

This research proposes to model sense detection using similar mechanisms to simulate the production of situational stimuli within the simulation environment.

The creation and propagation of sensory stimuli within the simulated environment(s) would likely be handled by the event management system within the simulation framework library, which is currently under development (along with other framework components that aim to simulate causality and the formation of expectations from sensory input), and would continue to evolve with this research.

Methods and Analysis

The general research approach is outlined as a series of phases, as illustrated in Figure 4.

The first phase is concerned with modelling and simulating virtual environments to establish a situational learning environment for NPCs.

The next stage creates self-awareness models in NPCs that are capable of experiencing and sensing their virtual environments.

The penultimate phase implements autonomous exploration of NPCs within the virtual environments to dynamically collect real-time observations about the environment to inform the models used for online learning.

The final stage is to use the learnt models to produce adaptive and contextual behaviour and decision-making in the NPCs.

Tools and Techniques

A foundational cornerstone of this research is the use of computer game technology and AI to develop real-time simulations of NPCs and their environments to model learning, adaptability, and the implementation of self-awareness and creation of contextual behaviour.

This requires translating abstractions that capture methods for implementing these abilities into software models which can be harnessed and utilized by the reasoning repertoire of any such agent to assist in carrying out sophisticated tasks that are normally otherwise expected of living things (remembering, deliberating, weighing options, feeling and deciding based on their own motivations, circumstances and requirements). This means a large portion of this research requires advanced programming for modelling ideas and implementing them as software models within computer gaming simulations.

For example, simulations include implementing aspects of 2D/3D graphics, implementing and developing scripting and AI systems and engineering the design of the software models that model ideas such as the sensory and event management systems, the autonomy and exploratory behaviour of NPCs and their reasoning capabilities, etc. Other techniques often used in game development include the flexibility of data-driven development, which is capable of dynamically loading, storing and transferring models, behaviours, and other functionality. The formation of these models, and the techniques and approaches for implementing them, are ultimately what will form the contribution of this research.

Technologies to simulate the experiments are expected to include C++, Lua, OpenGL/DirectX, and the use of game AI systems that can be used to implement autonomous exploration and store and trigger behaviour such as FSMs, BTS and more flexible learning system,s which include models for developing persistent memory and self-awareness, including learning systems such as machine learning.

An important aspect of implementing a framework for a system is a systematic approach to its design and engineering. A strategic part of this research is to adopt an approach to engineering software models that are reusable and adaptable to change.

To achieve this, this research will develop a software library (or libraries) alongside the main agent simulations, and this will be integrated into these simulations. In this way, much of the programming and model implementation will be done in an abstract way such that the code, including the AI models, can be reused throughout further simulations. This will encourage the composition of more creative variations from the engineered models.

By creating a library (or libraries) that embody the different learning approaches and strategies used in the research, they can then be used to highlight and externalise learnt approaches, as well as be used to produce isolated demonstrative examples. This will also help with an iterative progression of general understanding by incorporating them into newly developed models.

This component-based strategy is also likely to improve the rate at which productivity can proceed, particularly in making future sensory-based systems, as the artefacts and models can be more easily used by other participants who wish to incorporate the ideas into independent projects or participate in this research. This will enable them to be more reusable in more situations, and also can drive creativity through making experimental variation easier.

This will also allow the ability to abstract and externalise the complexity of systems and sub-components that are developed, such that they can be practically studied independently.

As the research progresses, the library could develop into discernible areas which can be tested and reasoned about separately. These component-based libraries will also allow for the validation of software models that can be verified as correct through unit testing. This will not only encourage change and refactoring, but also likely help document the components while showing how they are used. It's strongly believed that verifiable software models that are testable make exploration and usage of those models more robust than if they were integrated tightly into the specific simulations.

Advancing knowledge

In many ways, a simulated virtual entity such as an NPC and their advancements in dealing with circumstantial information can be considered as an abstraction for what could be utilised in other fields, such as robotics, intelligent assistants or even self-driving cars. This makes researching and advancing these capabilities pivotal in developing more intelligent and self-aware applications in the future.

This research aims to explore how using computer game technologies can be used to simulate and develop new and novel real-time models for characterising the ‘experienced’ unknown, improving contextual adaptation, behaviour and situation detection in real-time situations.

Beyond the simulation and development of models for self-awareness and real-time learning in NPCs within sensory environments, some specific approaches might be explored in more detail during this research.

Temporal Networks

Temporal networks are graph representations of relationships that occur over time between entities, often represented as ‘nodes’, with edges between them representing a relationship (Figure 5).

Temporal vs Static Graphs (Wu et al., 2014) — Figure 5: Temporal vs Static Graphs (Wu *et al.*, 2014)

An approach that is compelling is to stream the generation of temporal networks from the real-time experiences of causality based on stimuli and responses within virtual worlds.

This offers a means to study how these networks evolve over time, including how similar or dissimilar they are in certain situations, and what they can tell us about the nature of those interactions and the effect that environmental stimuli have.

Work by Masuda and Holme (2019) investigated how general system states could be detected when representing states as sequences of changing directed graphs over time (temporal networks), as shown in Figure 6.

This insight provides the opportunity to model the formation of real-time situations as the formation of temporal networks.

These could be used to model the relationships that define and identify situations in real time. This also provides the potential to model situational changes in response to experienced stimuli within the virtual environment, as evolving temporal networks.

Figure 6: Representing system states over time (State Dynamics) (Masuda and Holme, 2019)

In this proposed research, we might not only be concerned with detecting system states (situations) from sensory events, but also allowing agents to identify the events that cause those states to happen and also predicting the behaviour/situation that is likely to occur as a result of the event.

Research by (Wu et al., 2014) shows that ‘minimum temporal paths’ statistics (earliest arrival path, latest departure path, fastest path, shortest path) can be used to analyze temporal networks, and this has been used in related research by (Ceria and Wang, 2023), (Zhan et al., 2021), to show that it is possible to characterize and generalize the temporal and spatial properties of evolving networks to distinguish differences or similarities between them over time.

The implication of this is that such mechanisms, and improved tooling for calculating temporal metrics such as those provided by (Oettershagen and Mutzel, 2022), could be used by NPCs to compare properties of encountered situations to detect similar situations in real-time within sensory environments.

To represent sensory and environmental changes over time as temporal networks, this research proposes to model the formation of such temporal networks in real-time from the sensory events that occur in response to the experiences the agent(s) encounter within the sensory world. Such time-based graph sequences could be used to form signatures to detect situations and derive richer causality relations within the world.

An important consideration is to be able to accurately distinguish discrete states and their contextual characteristics in real-time amidst complex, noisy and high-event rate virtual reality simulations. While (Masuda and Holme, 2019) uses graph distance, and (Zhan et al., 2021) the fastest arrival distance (FAD) to identify and generalize network graphs into discrete states, this might be too coarse a generalization, and might require more features of the network/environment to be considered in representing situational state, particularly in complex environments that can produce large volumes of data such as those seen in real-time environments (like games).

Another interesting avenue of simulation is using real-time (online) sensory experience obtained within the virtual environment to train and test NPC behavioural and self-awareness models using machine learning.

For example, techniques such as Bayesian networks, Hidden Markov models and SVM could be used to incrementally describe sensory events that are experienced by the agent.

In conjunction with the above methods, research by IJsselmuiden et al. (2014) showed how the use of Situation Graph Trees (SGTs) can be used for modelling and detecting dynamic situations. These are in many ways like behaviour trees (BTs) that detect situations by traversing a graph of conditions that match situation signatures.

While SGTs, like BTs, are often predefined constructions used to respectively detect well-known situations (see figure 7) or trigger well-known behaviour (in the case of BTs), an interesting pursuit is the dynamic creation of SGTs and BTs, based on the real-time experiences of agents encountered within virtual environments.

Situation graph tree (IJsselmuiden et al., 2014) used to detect group behavior — Figure 7: Situation graph tree (IJsselmuiden *et al.*, 2014) used to detect group behavior

Observing and modelling behaviour

Typically, behaviours in games are not generated from experience and instead are pre-defined behavioural logic or scripts that define how the agent should behave in predefined circumstances. This would be pre-defined actions/behaviour to predefined circumstances, meaning behaviour is not created, but that pre-existing behaviour is triggered mostly using finite state machines (FSMs) and BTs.

Through this research, there is scope for an experiential learning model to dynamically generate these behavioural scripts in real-time as behaviour trees (see figure ).

For example, an approach developed by Robertson and Watson (2015) was to dynamically generate behaviour trees (BTs) based on the supply of recurring textual action sequences. Their research showed that a resultant “… behaviour tree was able to represent and summarise a large amount of information from the expert behaviour examples”. These examples could instead come from the actions of NPCs and other dynamic objects that ‘live’ and behave within real-time virtual environments.

This would allow agents to dynamically create rules for observed behaviour based on their experience and observations of situations and the behaviour of others. The agent could use this as part of its behavioural knowledge base and carry out those behaviours in the future. Equally, these learnt behaviours (trees) might be transferable to other agents for them to use when situations call for it, in a similar way that transfer learning is used in machine learning.

Scene Graphs

An approach to modelling situation detection that is also compelling is to compose sensory virtual world objects into a hierarchical dependency scene graph and to model sensory events as influences upon those objects.

This may provide opportunities to identify the effects of causality and the influence it has on multiple related objects, such as the propagation of effects to their dependencies.

For example, pushing a box with a pen in it will also move the pen relative to the box. In this way, when a stimulus affects an object, the resulting change/response can be observed in its dependencies, allowing for a better generalisation of the effect of the stimulus.

A hierarchical representation could serve as a means to aggregate, group and generalise the effects of multiple changes in the environment.

For example, this could help analyse situational consistency and help derive situational boundaries by forming level of detail change calculations, where a lower level of detail might ignore smaller, more frequent or consistent changes in the scene graph and focus on bigger changes in order to generalise a situation boundary at that level.

Figure 8 illustrates an example of organising virtual objects into a scene graph.

Figure 8: Organizing objects into a scene (Unknown, 2013)

This approach should also make it possible to identify which objects should be actively observed for changes (as a result of stimuli in the environment) without checking every object in the scene, e.g., pushing the box means its dependents might not need to be actively checked, as the majority of their changes are likely derived from the box’s changes, therefore saving computation time. While this is useful, there is likely still a need to be able to determine when objects that are still important, i.e make up a situation, but have not actually changed in response to the stimuli (universals).

The basic premise is that the scene graph is checked for differences caused by stimuli, and these changes can be used to detect situational boundaries. An illustration of this approach is outlined in Figure 9.

Figure 9: Detecting changes in the environment due to environmental stimuli using a scene graph

About me

I have been actively engaged in researching computer game technologies while building out a game engine that incorporates many of my interests in game development, including graphics, multiplayer networking and behavioural AI.

Currently, I work full-time (remotely) in Uxbridge as a software developer for Citrix Systems, creating a variety of hybrid systems (on-premise and cloud-based). Prior to this, I worked in a similar capacity for VMWare. I have been developing systems for software companies for the last 15 years, most of which while studying.

I enjoy the process of learning, and in that time, I have completed the following studies:

MSc Software Engineering
MSc Computer Games Technology
MSc Cyber Security
BSc (Hons) Open

My last three theses have revolved around researching computer games, namely:

Evaluating the design requirements for a secure, low-latency, multiplayer network protocol.
Applying and evaluating functional programming paradigms and techniques in developing games.
Applying pattern-oriented designs in complex software (Games)

I’d like to continue learning by pursuing a formal research degree, focusing on what I already do in my spare time, which is studying computer game development.

This research project is intended to be self-funded, part-time and based primarily in Uxbridge.

Stuart Mathews

References

‘AI Perception’ (no date). Available at: https://docs.unrealengine.com/5.1/en-US/ai-perception-in-unreal-engine/ (Accessed: 5 January 2023).

Alemaw, A. S. et al. (2025) ‘Modeling Interactions between Autonomous Agents in a Multi-Agent Self-Awareness Architecture’, IEEE Transactions on Multimedia, pp. 1–16. doi: 10.1109/TMM.2025.3543110.

Ceria, A. and Wang, H. (2023) ‘Temporal-topological properties of higher-order evolving networks’, Scientific Reports, 13(1), p. 5885. doi: 10.1038/s41598-023-32253-9.

IJsselmuiden, J. et al. (2014) ‘Automatic understanding of group behavior using fuzzy temporal logic’, Journal of Ambient Intelligence and Smart Environments, 6(6), pp. 623–649. doi: 10.3233/AIS-140290.

Lewis, P. R. et al. (2015) ‘Architectural Aspects of Self-Aware and Self-Expressive Computing Systems: From Psychology to Engineering’, Computer, 48(8), pp. 62–70. doi: 10.1109/MC.2015.235.

Masuda, N. and Holme, P. (2019) ‘Detecting sequences of system states in temporal networks’, Scientific Reports, 9(1), p. 795. doi: 10.1038/s41598-018-37534-2.

Mathews, S. (2022) ‘Cppgamelib’. Available at: https://github.com/stumathews/cppgamelib (Accessed: 21 January 2023).

Oettershagen, L. and Mutzel, P. (2022) ‘TGLib: An Open-Source Library for Temporal Graph Analysis’. arXiv. doi: 10.48550/arXiv.2209.12587.

Rabin, S. (2014a) ‘Crytek’s Target Tracks Perception System’, in Game AI Pro. A K Peters/CRC Press, pp. 432–441. doi: 10.1201/b16725-37.

Rabin, S. (2014b) ‘How to Catch a Ninja: NPC Awareness in a 2D Stealth Platformer’, in Game AI Pro. A K Peters/CRC Press, pp. 442–451. doi: 10.1201/b16725-38.

Robertson, G. and Watson, I. (2015) ‘Building behavior trees from observations in real-time strategy games’, in. IEEE, pp. 1–7. doi: 10.1109/INISTA.2015.7276774.

Unknown (2013) ‘Game Engine Design Blog: Scene Graphs’, Game Engine Design Blog. Available at: https://aldwinchogameengine.blogspot.com/2013/11/scene-graphs.html (Accessed: 16 March 2025).

Wu, H. et al. (2014) ‘Path problems in temporal graphs’, Proceedings of the VLDB Endowment, 7(9), pp. 721–732. doi: 10.14778/2732939.2732945.

Zhan, X.-X. et al. (2021) ‘Measuring and utilizing temporal network dissimilarity’, arXiv.org. Available at: https://arxiv.org/abs/2111.01334v1 (Accessed: 1 January 2024).

Projects

Login

Twitter

A Systematic Research Review Process

Situation detection using Bayesian networks

Research Proposal

Questions

Introduction

Index

Research Proposal

Sensory virtual environments

Outcomes

Aim

Outcomes

Design Approach

Methods and Analysis

Tools and Techniques

Advancing knowledge

Temporal Networks

Observing and modelling behaviour

Scene Graphs

About me

References

More Articles …