Design Patterns for LLM-based Neuro-Symbolic Systems

Tracking #: 823-1819

Flag : Review Received

Authors:

Maaike de Boer

Quirine Smit

Michael van Bekkum

André Meyer-Vitali

Thomas Schmidt

Responsible editor:

Guest Editors Knowledge Graphs and Neurosymbolic AI 2024

Submission Type:

Article in Special Issue (note in cover letter)

Full PDF Version:

nai-paper-823.pdf

Supplementary Files:

nai-supplementary-823.pdf

Cover Letter:

Dear Editor, Please find enclosed our revised manuscript entitled “Design Patterns for LLM-based Neuro-Symbolic Systems”, which we are re-submitting for exclusive consideration of publication as an article to the special issue on "Knowledge Graphs and Neurosymbolic AI" of the Neurosymbolic Artificial Intelligence journal. We have also attached our response to the reviewers’ feedback as a supplementary file. Based on the feedback provided by the reviewers, we provide a refined approach to use and extend the modular design patterns and Boxology language of van Bekkum et al. to fit large language models (LLMs). For our revised work, we have carefully taken into account the helpful comments and suggestions of the reviewers and reworked the overall structure of the manuscript, rewritten large parts of the text, updated several figures and added a significant number of additional references to support our approach and argumentation better. Thank you for your consideration of our work. Please feel free to correspond with us by e-mail using maaike.deboer@tno.nl. Sincerely, Maaike de Boer, Quirine Smit, Michael van Bekkum, André Meyer-Vitali and Thomas Schmid RESUBMIT of #787-1778

Approve Decision:

Approved

Revised Version:

Design Patterns for LLM-based Neuro-Symbolic Systems

Previous Version:

Design Patterns for LLM-based Neuro-Symbolic Systems

Tags:

Reviewed

Decision:
Minor Revision

Solicited Reviews:

Review #1 submitted on 19/Apr/2025

By Anonymous User
Review Details

Reviewer has chosen to be Anonymous

Overall Impression: Good

Content:
Technical Quality of the paper: Good
Originality of the paper: Yes, but limited
Adequacy of the bibliography: Yes

Presentation:
Adequacy of the abstract: Yes
Introduction: background and motivation: Good
Organization of the paper: Satisfactory
Level of English: Satisfactory
Overall presentation: Good

Detailed Comments:

The structure of the paper has been improved considerably compared to the previous version, which makes it easier to read and understand. The paper's contribution is clearly highlighted in the current form compared to the authors' earlier works. Furthermore, the original boxology notation and design patterns are sufficiently explained to provide a reasonable basis for understanding the contribution of this paper.

Nevertheless, there are still several comments which need to be addressed before the paper is ready to be published, although it should not take too much effort to address, as the following:

* P4 L50: The third level of naming detail in the boxology (e.g., NN in model:stat:NN) is not used in any figures in this paper. Is this intentional, in the sense that such detail is not a focus or of importance to the boxology notation? If this is not the case, I suggest adding those details for all boxology figures in Section 5 since the information on this third level is available in the text description but not in the figure.

* P7 L9: The naming of the subsections in Section 4 is a bit confusing. While I understand the reasoning, I would suggest renaming 4.2 to "Deployment" instead of "Application" to avoid confusing it with the applications mentioned in Section 5.

* Consider repositioning Figures 3-8 directly after their corresponding text/sub-sections for easier reference while reading the paper instead of combining them into a single page for each section.

Minor comments:
* P4 L3: "it not suprising ..." -> "it is not surprising ..."
* P4 L25: "We will base ..." -> "We base ..."
* P9 L7: "utelizing the knowlegde" -> "utilizing the knowledge"
* P9 L14: "which may happen a long time before" -> clarify that this means a long time before the deployment.
* P11 L6: "We also included ChatGPT, which is the most famous generative AI system..." -> You might want to remove this sentence.

Review #2 submitted on 13/Apr/2025

By Anonymous User
Review Details

Reviewer has chosen to be Anonymous

Overall Impression: Good

Content:
Technical Quality of the paper: Excellent
Originality of the paper: Yes
Adequacy of the bibliography: Yes

Detailed Comments:

The paper has been greatly improved and streamlined. The abstract and introduction provide a clear contextualization of the work, together with an argumentation of the gaps in LLMs and the promise of neuro-symbolic AI. The related work seems more complete and well-organized. The extensions in 3.1 and 3.2 are well-motivated and presented. The presentation of the patterns in line with the survey on combining LLMs and KGs in §4 is convincing, whereas the specific systems being analyzed in §5 are now nicely linked to the categories in §4. The discussion and future work items in §6/§7 are interesting and provide a reasonable closure to the paper.

One consideration that emerges from the paper is the significant loss of information when abstracting a system with one of the patterns. While this is exactly the point of a boxological abstraction, it would be good if the authors discuss how the gap between the concrete system and the very abstract boxology pattern can be bridged.

Given the many typos, it would be great if the authors could carefully inspect/validate their document using a grammar checker.

Detailed suggestions:
- The structure presented in the last paragraph of §1 seems misaligned with the following sections: at least, §5 is not covered.
- "close-human" -> "near-human" or "close-to-human";
- "More recently, the concept has even been extended" -> which concept?
- "purely statistical LLM" -> "...LLMs"
- "It was extended in ..., the Boxology represents..." -> This sentence seems ungrammatical.
- Why is "Boxology" capitalized? Similarly, why is "Generative Neuro-Symbolic Systems" capitalized, while "Neuro-Symbolic systems" is partially capitalized?
- In 2.1, the first sentence talks about symbolic AI and ANNs, whereas the second about KE and ML - it would be good to align the concepts between the two sentences.
- "allowing the combine and reuse" -> "allowing the combination and reuse of"
- "generate text in convincing quality" -> "...with convincing quality"
- The first sentence of §4 seems confusing - If the quality is convincing, how come the precision is low?
- "utelizing" -> "utilizing"
- Perhaps a better title of §4.2 is "LLM-based Neuro-Symbolic Design Patterns during Inference"?
- In §5.1, regarding the explanation of the retrieve: is this only pattern 2a, or both 2a & 1d? This can be clarified better in the text.
- "LL-augmented"?
- The application of the boxology in a practical classroom setting (§6) comes out of the blue - I suggest that the authors either discuss this use case in detail, or drop this argument from the paper.
- The structure of §6 and §7 can be revised to streamline this content better. Both §6 and §7 provide some takeaways from the work, both point to some successes and to some weaknesses/limitations. Perhaps this is a sign that these two sections should be merged?

Review #3 submitted on 13/Apr/2025

By Anonymous User
Review Details

Reviewer has chosen to be Anonymous

Overall Impression: Average

Content:
Technical Quality of the paper: Average
Originality of the paper: Yes, but limited
Adequacy of the bibliography: Yes

Presentation:
Adequacy of the abstract: Yes
Introduction: background and motivation: Limited
Organization of the paper: Satisfactory
Level of English: Satisfactory
Overall presentation: Average

Detailed Comments:

The article has been significantly improved from the previous version and the narrative is more cohesive and well articulated. The contribution is now more narrow and clear, and I found the removal of some previous subsection to have improved the scope of the article. Nonetheless, some sections (especially sections 6 and 7) still necessitate more improvement, as described below.

Overall, the approach is flexible and scalable, and the formulation of the new patterns through the reuse and the contextualisation of the previous Boxology patterns as compositional modules further demonstrate the expressiveness of the framework.

The literature mentioned in Section 5 when presenting the use cases also demonstrates a good level of coverage of the proposed patterns.

Pointers for further improvement
--------------------------------

- P2, Line 23. Fine-tuning can also lead to catastrophic forgetting.

- P2. The authors present two common approaches to address hallucinations: fine-tuning and RAG, mentioning that they introduce novel challenges by themselves. However, only the challenges of finetuning are discussed in the paragraph.

- P2. Overall, I got the impression that the limitations of LLMs and Generative models in general are used as motivations of this work, but I do not think this is necessary given the scope of this contribution. I would recommend to focus on providing a short overview of the different methodologies: statistical modelling / fully data driven approaches, symbolic, neuro-symbolic (which all come with their own merits and in their respective fields) to contextualise this work. This short introduction would also be useful to introduce NeSy AI in order to reach a potentially larger audience.

- P2. While I really appreciate the potential of this work, I believe that the introduction still misses a paragraph that comprehensively highlights the motivations of this article and the benefits/potential brought by this work. Currently. the introduction goes straight from the limitations of LLMs to the difficulty of keeping track of an evolving landscape of new models (for which a reader may get different ideas on how to address this problem) and the extension of Boxology. I understand this work extends previous work (which is indeed referenced), but I would appreciate a more detailed presentation of the motivations and benefits of this approach in particular.

- P5. The authors mention (in 3.1 and 3.2) that the proposed pattern can generalise to other modalities, but little information is given on how this would be possible. Providing more insights on how this can be achieved would strengthen these arguments. Also, what about models handling multiple modalities as input?

- Section 4, opening (P7). Re-emphasising again the limitations of LLMs in this section, after this was done in the previous parts, sounds a bit repetitive and may also set the reader off; especially with the strong arguments on LLMs' inability to understand concept of truth, casuality, etc. I believe this section should really focus on the patterns.

- Section 4. While relevant and well-linked with respect to the literature, I still found the description of the patterns a bit disconnected from the diagrams. Adding some basic notational elements from Boxology such as `model:LLM`, `infer:deduce` in \texttt{} while the patterns are presented and explained would significantly improve this connection with little effort.

- Section 5. The current structure of this section focuses on introducing one or more relevant methods for each category/subsection (such as Retrieval Augmented Generation - RAG) and then pointing to the corresponding pattern. However, I found the level of detail of this elaboration a bit non-uniform across the various subsection. For example, the way the pattern covers RAG in 5.1 is clear, whereas other subsections conclude with pointers to figures or other subsections without much detail on how these are actually captured/mapped.

- Section 6, Discussion. While the contributions of the paper are clear, I found that: (i) some arguments remain a bit vague (such as the remark on data and symbols) and would need more elaboration and specificity; and (ii) an overview of the potential scenarios for reuse of this work is still missing. For example, I can see how the Boxology extensions could be reused for illustrating and explaining the models and the processes to a broader audience, and even lead to the identification of their "common inner workings" when it comes to training, inference, generation; or even the possibility of "model checking" ML models? I am trying to be a bit speculative here, just to give an idea of what I was expecting at this stage. Overall, the current discussion section reads a bit rushed, and a more elaborate analysis of this approach, along with the limitations, but also covering what this work actually enables now that we have these new patterns would make the whole contribution stronger.

- Section 7, Conclusions and FW. I strongly encourage the authors to revisit and improve this part, especially the first paragraph. The current version provides very little context on the research problem and the contribution made by this work.

- There are still some typos and incomplete sentences in the document. This is a non-exhaustive list:
- P5, Line 7. "The elementary patterns 2a-2d are patterns that describe to use a model" [incomplete sentence?]
- P9, Line 7, "utelizing" [typo]
- P11, Line 51 "LL-augmented KG" [typo]

Tracking #: 823-1819

Flag : Review Received

Authors:

Responsible editor:

Submission Type:

Full PDF Version:

Supplementary Files:

Cover Letter:

Approve Decision:

Previous Version:

Tags:

Recent blog posts

Journal Info

Submit

For Reviewers

Links

Search form

Tracking #: 823-1819

Flag : Review Received

Authors:

Responsible editor:

Submission Type:

Full PDF Version:

Supplementary Files:

Cover Letter:

Approve Decision:

Previous Version:

Tags:

Journal Info

Submit

For Reviewers

Links