VII. Limitations & Future Directions

VII. Limitations, Extensions & Future Research Directions

This section serves to:

Identify the intentional scope boundaries of this research
Describe data, model, and methodological limitations
Highlight interpretations requiring caution
Open pathways toward academic and system-level extensions

In other words: this section clarifies that the work does not claim beyond its scope, representing one step in exploring "sequential patterns in financial markets."

7.1 Conceptual Limitations

7.1.1 Pattern ≠ Predictive Rule

Although some results may show clear statistical relationships between

$$\tilde{\mathcal{S}}^a \longrightarrow y_t^a$$

this does not:

Constitute a predictive rule
Guarantee persistence into the future
May only represent structural correlation under specific conditions

Therefore, interpretation must remain within the framework of "evidence of structured relational behavior" rather than "trading signal."

7.1.2 Regime Awareness without True Causality

While macro event tokens enable regime-aware representations:

This work cannot prove causality
Not all confounders are isolated

A formal reminder:

$$P(y \mid \text{macro}, \text{micro}) \neq P(y \mid \text{do(macro)})$$

Therefore, this work constitutes observational-sequence analysis, not causal inference.

7.2 Data & Construction Limitations

7.2.1 Survivorship & Availability Bias

If the asset set excludes:

Delisted stocks
Assets with data gaps

Bias will exist in $\mathcal{A}$, potentially distorting learned patterns.

7.2.2 Event Definition Noise

The event generation functions $\Phi_{asset}, \Phi_{macro}$:

May partially reflect human definition choices
If thresholds or trigger rules change, pattern representations may change accordingly

7.2.3 Latency & Recording Gap

Some events occur in the real world before their timestamps are recorded in the data.

This creates temporal discrepancy:

$$t_i^{observed} \neq t_i^{true}$$

Which may affect sequential interpretation.

7.3 Model & Learning Limitations

7.3.1 Long-Horizon Dilution

Although transformers support long-range dependencies, when $n$ is large:

$\alpha_{ij}$ may become highly diluted, causing long-horizon patterns to be lost.

Potential solutions include:

Hierarchical encoders
Memory-augmented attention

These remain outside the current scope.

7.3.2 Representation Entanglement

The representation $u_t^a$ may conflate micro and macro influences:

$$u_t^a \approx \Phi(\text{micro}, \text{macro})$$

Making interpretive disentanglement difficult, even with per-era analysis.

7.3.3 Sparse Outcome Learning Risk

For extremely rare outcomes:

Even with weighting / focal loss
Signals may remain unstable

Therefore, this work does not draw practical conclusions from results in any single period.

7.4 External Validity & Transfer Limitations

This model is trained on market $\mathcal{A}, \mathcal{R}$ for the specific dataset, and does not claim validity for:

Markets in other countries
Other asset classes
Time periods with different market structures

Results represent context-dependent evidence, not general laws.

7.5 Ethical & Practical Boundaries

Not to be used to influence others' investment decisions
Not to be interpreted for profit expectations
This is exploratory and structural research

7.6 Future Research Directions (Extensions)

Despite limitations, this work opens several extension pathways:

(A) Hierarchical Multi-Scale Event Modeling

Adding a two-layer structure:

$$\text{intra-pattern} \;\&\; \text{inter-pattern}$$

To learn:

Short-horizon sub-patterns
Long-horizon meta-patterns

(B) Cross-Market Transfer & Domain Adaptation

Testing:

$$u_t^{(\text{market}\,1)} \longrightarrow \text{market 2}$$

To determine whether patterns are market-general or market-specific.

(C) Semi-Supervised & Representation-Only Training

Reducing direct outcome dependence through:

Masked-event prediction
Contrastive sequence learning

Then testing embeddings against outcomes subsequently.

(D) Deeper Macro–Micro Interaction Modeling

Such as:

Cross-attention between macro-stream and asset-stream
Graph-structured market context

Rather than single-layer sequence integration.

(E) Toward Causal-Structure Hypothesis Testing

More rigorous extensions such as:

Invariant risk minimization (IRM)
Quasi-natural experiment regimes

To approach causal reasoning without violating time-causality.

7.7 Summary of Research Position

In summary, this research:

Proposes a framework for event-sequence representation + macro-conditioned learning
Designs time-consistent training and evaluation processes
Establishes transparent research interpretation principles

While acknowledging limitations directly and opening space for future in-depth studies.

7.8 Connection to Final Section

The next section (Section 8) serves as Conclusion & Practical Integration Note.

Which will:

Summarize the work's key contributions
Connect systematically to how engineering teams can develop this framework further