Detecting a shortcut macro call in a field, and preserving it for other operations on the field contents

Hey all!

I have fields in my TW that can contain macro shortcuts (<<macroname parameter>>). Those also contain elements I’m trying to style in a specific way in order to prettify the rendering of the field value wherever relevant. In order to style the relevant elements, I have to split[ ] on spaces, which will split the macro shortcuts as well, and even recomposing the field afterwards would render the macro shortcut calls as plain text:

image
Ex. of a rendering without doing anything specific to the macro shortcuts; they are recomposed correctly, but not computed, while the desired formatting is applied to the “1s” element of the field.

I figured I’d use conditionals to detect if a macro shortcut exists in the field value, then manipulate it so it’s preserved aside (replacing the space character with “?” in the example below), before doing the rest of the formatting that I want on the field (then replacing back the “?” with a space character). The below procedure (which is fed the value of a field to be parsed for formatting) is what I came up with:

\procedure timeparse2(value)
<% if [<value>regexp[<<\w+\s.+>>$]] %>
	<$list filter=`[search:$(value)$:regexp[<<\w+\s.+>>$]]` variable="macroshortcut">
<!--                                        ^ I'm guessing this cannot work because the angled brackets would break the list widget? -->
(<<macroshortcut>>)
		<$let tempvalue={{{ [<macroshortcut>search-replace[ ],[?]] }}}>(<<tempvalue>>)
			<$list filter="[<tempvalue>split[ ]]" variable="seg" join=" ">
				<% if [<seg>regexp[\d+s$]] [<seg>regexp[\d+m$]] [<seg>regexp[\d+h$]] [<seg>regexp[\d+d$]] %>
					<$macrocall $name="time" value=<<seg>>/>
				<% elseif [<seg>regexp[\d+s,$]] [<seg>regexp[\d+m,$]] [<seg>regexp[\d+h,$]] [<seg>regexp[\d+d,$]] %>
					<$let segm={{{ [<seg>split[,]first[]] }}}>
					<$macrocall $name="time" value=<<segm>>/>, </$let>
				<%elseif [<seg>regexp[<<\w+.+>>$]] %>
					{{{ [<seg>search-replace[?],[ ]] }}}
				<% else %>
					<<seg>> 
				<% endif %>
			</$list>
		</$let>
	</$list>
<% else %>
	<$list filter="[<value>split[ ]]" variable="seg" join=" ">
		<% if [<seg>regexp[\d+s$]] [<seg>regexp[\d+m$]] [<seg>regexp[\d+h$]] [<seg>regexp[\d+d$]] %>
			<$macrocall $name="time" value=<<seg>>/>
		<% elseif [<seg>regexp[\d+s,$]] [<seg>regexp[\d+m,$]] [<seg>regexp[\d+h,$]] [<seg>regexp[\d+d,$]] %>
			<$let segm={{{ [<seg>split[,]first[]] }}}>
			<$macrocall $name="time" value=<<segm>>/>, </$let>
		<% else %>
			<<seg>> 
		<% endif %>
	</$list>
<% endif %>
\end

Unfortunately, I’m guessing that using directly regexp[<<\w+\s.+>>$] cannot work because of the angled brackets in the filter expression (for the same reason using any square brackets in a regexp expression would also break the filter).

Does anyone know if there would be a simple way to handle such cases? I tried using functions like:

\function macro-regexp() [<<\w+\s.+>>$]

To then write the filter with:

<$list filter=`[search:$(value)$:regexp<macro-regexp>]` variable="macroshortcut">

But it didn’t work (actually breaking a whole lot of stuff). Maybe backtick substitutions would? Or is there another deeper factor that makes the use case tougher to handle?

Instead of using regexp to find the embedded macro syntax, perhaps you can avoid the whole problem by just pre-processing the input string using $wikify, which will automagically convert all embedded macros into their plain text results, and then continue on with the “apply special formatting to numbers” logic, like this:

\procedure timeparse2(value)
<$wikify name=value text=<<value>>>
<$list filter="[<value>split[ ]]" variable="seg" join=" ">
	<% if [<seg>regexp[\d+s$]] [<seg>regexp[\d+m$]] [<seg>regexp[\d+h$]] [<seg>regexp[\d+d$]] %>
		<$macrocall $name="time" value=<<seg>>/>
	<% elseif [<seg>regexp[\d+s,$]] [<seg>regexp[\d+m,$]] [<seg>regexp[\d+h,$]] [<seg>regexp[\d+d,$]] %>
		<$let segm={{{ [<seg>split[,]first[]] }}}>
		<$macrocall $name="time" value=<<segm>>/>, </$let>
	<% else %>
		<<seg>> 
	<% endif %>
</$list>
\end

Let me know how it goes…

-e

Very interesting idea!

It sort of works; however, since the macros in the field also use numbers, they seem to get interpreted as a valid element to be formatted as a time instead of retaining the macro shortcut syntax.

image (I added the 1m 1s, in the field to test for correct detection)

For example, the correct formatting should be: image - but when <<cylinderytom 1>> gets wikified, it outputs a 1m (1yd) (that in this case means 1 meter, not 1 minute); so it gets picked up by the logic of the procedure.

I guess I could solve this by changing the format of the times and the regexp to work with 1sec, 1min, etc. rather than just single letters.

Edit:
So I changed the procedure to this:

\procedure timeparse2(value)
<$wikify name=value text=<<value>>>
<$list filter="[<value>split[ ]]" variable="seg" join=" ">
	<% if [<seg>regexp[^\d+(sec)$]] [<seg>regexp[^\d+(min)$]] [<seg>regexp[^\d+(hour)$]] [<seg>regexp[^\d+(day)$]] %>
		<$macrocall $name="time" value=<<seg>>/>
	<% elseif [<seg>regexp[^\d+(sec),$]] [<seg>regexp[^\d+(min),$]] [<seg>regexp[^\d+(hour),$]] [<seg>regexp[^\d+(day),$]] %>
		<$let segm={{{ [<seg>split[,]first[]] }}}>
		<$macrocall $name="time" value=<<segm>>/>, </$let>
	<% else %>
		<<seg>> 
	<% endif %>
</$list>
\end

But unfortunately, it doesn’t work either:
image

edit 2: and if I rename the $wikify variable and use it like this

\procedure timeparse2(value)
<$wikify name=wikifiedvalue text=<<value>>>
<$list filter="[<wikifiedvalue>split[ ]]" variable="seg" join=" ">
	<% if [<seg>regexp[^\d+(sec)$]] [<seg>regexp[^\d+(min)$]] [<seg>regexp[^\d+(hour)$]] [<seg>regexp[^\d+(day)$]] %>
		<$macrocall $name="time" value=<<seg>>/>
	<% elseif [<seg>regexp[^\d+(sec),$]] [<seg>regexp[^\d+(min),$]] [<seg>regexp[^\d+(hour),$]] [<seg>regexp[^\d+(day),$]] %>
		<$let segm={{{ [<seg>split[,]first[]] }}}>
		<$macrocall $name="time" value=<<segm>>/>, </$let>
	<% else %>
		<<seg>> 
	<% endif %>
</$list>
\end

it doesn’t render the styling from macro shortcuts either:

image

Edit 3: if I close the wikify widget like this:

\procedure timeparse2(value)
<$wikify name=value text=<<value>>>
<$list filter="[<value>split[ ]]" variable="seg" join=" ">
	<% if [<seg>regexp[^\d+(sec)$]] [<seg>regexp[^\d+(min)$]] [<seg>regexp[^\d+(hour)$]] [<seg>regexp[^\d+(day)$]] %>
		<$macrocall $name="time" value=<<seg>>/>
	<% elseif [<seg>regexp[^\d+(sec),$]] [<seg>regexp[^\d+(min),$]] [<seg>regexp[^\d+(hour),$]] [<seg>regexp[^\d+(day),$]] %>
		<$let segm={{{ [<seg>split[,]first[]] }}}>
		<$macrocall $name="time" value=<<segm>>/>, </$let>
	<% else %>
		<<seg>> 
	<% endif %>
</$list></$wikify>
\end

It fails as well.

I haven’t tested your regexp, but if it works, you could build the regex string first…
by splitting the string, you can get around the double angle brackets trying to resolve.

\procedure timeparse2(value)
<$let
  macroshortcutRegexp={{{  [[<<]] [[\w+\s.+>>$]] +[join[]] }}}
>

<% if [<value>regexp<macroshortcutRegexp>] %>
	<$list filter=`[search:$(value)$:regexp<macroshortcutRegexp>]` variable="macroshortcut">

<!-- rest of your code -->

Thanks for the suggestion! Doesn’t seem to work, though :confused:

Switching the markers from s, m, h, d to sec, min, hour, day, and wikifying the whole thing as per @EricShulman’s suggestion, I get:


So the initial macro-finding regexp seems to work, but the whole procedure doesn’t preserve the function of the detected macros. For reference, the above result comes from this version:

\procedure timeparse2(value)
<$wikify name=value text=<<value>>><$let
  macroshortcutRegexp={{{  [[<<]] [[\w+\s.+>>$]] +[join[]] }}}
  macro2shortcutRegexp={{{ [[<<]] [[\w+.+>>$]] +[join[]] }}}
>
<% if [<value>regexp<macroshortcutRegexp>] %>
	<$list filter=`[search:$(value)$:regexp<macroshortcutRegexp>]` variable="macroshortcut">
		<$let tempvalue={{{ [<macroshortcut>search-replace[ ],[?]] }}}>
			<$list filter="[<tempvalue>split[ ]]" variable="seg" join=" ">
				<% if [<seg>regexp[\d+(sec)$]] [<seg>regexp[\d+(min)$]] [<seg>regexp[\d+(hour)$]] [<seg>regexp[\d+(day)$]] %>
					<$macrocall $name="time" value=<<seg>>/>
				<% elseif [<seg>regexp[\d+(sec,)$]] [<seg>regexp[\d+(min,)$]] [<seg>regexp[\d+(hour,)$]] [<seg>regexp[\d+(day,)$]] %>
					<$let segm={{{ [<seg>split[,]first[]] }}}>
					<$macrocall $name="time" value=<<segm>>/>, </$let>
				<%elseif [<seg>regexp<macro2shortcutRegexp>] %>
					{{{ [<seg>search-replace[?],[ ]] }}}
				<% else %>
					<<seg>> 
				<% endif %>
			</$list>
		</$let>
	</$list>
<% else %>
	<$list filter="[<value>split[ ]]" variable="seg" join=" ">
		<% if [<seg>regexp[\d+(sec)$]] [<seg>regexp[\d+(min)$]] [<seg>regexp[\d+(hour)$]] [<seg>regexp[\d+(day)$]] %>
			<$macrocall $name="time" value=<<seg>>/>
		<% elseif [<seg>regexp[\d+(sec,)$]] [<seg>regexp[\d+(min,)$]] [<seg>regexp[\d+(hour,)$]] [<seg>regexp[\d+(day,)$]] %>
			<$let segm={{{ [<seg>split[,]first[]] }}}>
			<$macrocall $name="time" value=<<segm>>/>, </$let>
		<% else %>
			<<seg>> 
		<% endif %>
	</$list>
<% endif %></$let><$wikify>
\end

try using (<<__value__>>) instead of (<<macroshortcut>>) that way you aren’t wikifying it and it should call the macro instead of rendering it as text.

I actually just realized I had left some debugging in the procedure version above. I have edited it. There should not have been (<<macroshortcut>>) nor (<<tempvalue>>) in the list widget.

So you mean changing this line:

<$let tempvalue={{{ [<macroshortcut>search-replace[ ],[?]] }}}>(<<tempvalue>>)

to:

<$let tempvalue={{{ [<__value__>search-replace[ ],[?]] }}}>(<<tempvalue>>)

?

Edit: that’s what I get with that substitution (i.e. same): image

I get the same with the backtick version of the substitution:

<$let tempvalue=`{{{ [$(value)$search-replace[ ],[?]] }}}`>

(deleted the original post because I forgot to add something, here’s my updated reply:)

Personally, I wouldn’t try to parse WikiText syntax with regular expressions. This can lead to all kinds of problems, since plain regular expressions are incapable of correctly parsing recursive syntactic structures (paired and balanced block or string delimiters). JavaScript regular expressions support backreferences, so theoretically it seems possible, but still feels way to overengineered for a relatively simple formatting task like yours.

Here’s another idea: What if all of your text fields always wrap time values by a time procedure call, e.g.

<<time 1s>>, <<cylinderytom 1>> radius. Outwardmove <<cylinderytom 1>>/second of concentration

The time procedure could additionally allow an optional parameter like styled:"no" or styled:"yes". The procedure would then conditionally return either a styled or unstyled version of its first argument.

Inside the procedure, you could also rely on an external variable (say, styleTimes) that acts as a default value for when the styled parameter is missing. Any style arguments would override this variable when they are present. To control the default value, simply define a variable named styleTimes where you need it:

<$let styleTimes="yes">{{someTiddler!!fieldWithContentLikeShownAbove}}</$let>

That way, you could have almost complete control over the styling.

You can edit your own posts for quite some time. So deleting should not be needed, if you only want to add some new content or screenshots.

Good to know. Thanks.

Hey Roland!

Unfortunately, that option would not work for me. Most of the fields are used for computations before display. For example, in my use case (translating GURPS TTRPG rules in TW) a time value could be reduced or increased by other factors (like the advantages or disadvantages a character could pick up, or their skill level in a given skill, etc.).

As a result, the base value is preferable to me; the computations are more important than the styling in almost all cases.

The current example debated in this very thread is an edge case; the information I’m trying to style is specific to one spell tiddler where the casting_time field had extra details not usually present with most other spell tiddlers. If the character in this case picks up that spell, but also has other traits that influence the casting_time, I need to compute that influence, and said influence only affects the time value, not the extra details. But as a general personal guideline, I’m trying to write code that can handle such exceptions or edge cases and still deliver the prime value I’m after: automation of rules-induced calculations.

to illustrate the use case:


On the above image, we are dealing with the “Shape air” spell as listed on a Character sheet. It has, among other things, a skill level the character has mastered it to, a base_cost to cast in “FP” (Fatigue points), and additional_cost depending on the extent of the effect pursued by the character and a casting_time.

Since the Character has an advantage named “Magery” at a level above 0, I evaluate the skill with the bonus that Magery gives, by computing the amount of character points (CP; 16 in this case) the character invested in this spell skill, which amounts to a total skill of their IQ value +4, which is 15. Since this skill mastery level is at least 15, it gives the character discounts on the cost of casting the spell (as shown here by the addition of “(adj. from)” ; the casting time could be reduced as well, but that happens only for a skill level above 19.

So if you read all this, you see what I mean; automating such calculations to reflect the rules would be insanely harder to do after any styling - it is much preferable to me to compute first, then style the results.

Thanks for providing some extra context.

So this is how I understand it: the relevant fields all contain potentially complex free-form text. This text is parseable only in the sense that the text loosely adheres to some syntactic conventions and there’s only a limited subset of WikiText syntax allowed in there (e.g macro calls to specific macros). You want to find unstyled scalar values in the text in-between those macro calls (e.g. time duration values) and then style those values for display.

Even with these self-imposed syntax limitations, trying to recognize macro calls through regular expressions is not going to end well in some cases. I came up with the following monstrosity:

\procedure macro-shorthand-regexp()
<<(?<macroName>[^\s\(\>]+)(?:\s+(?<macroParams>(?:(?:(?:[-a-zA-Z0-9_]+\s*:\s*)?(?:\"\"\"(?:(?!\"\"\").)*\"\"\"|(?:\"[^\"]*\")|(?:\'[^\']*\')|(?:\[\[[^\]]*\]\])|(?:(?:(?!\>\>)[^\"\'\s])+)))\s*)*))?\s*>>
\end

That regexp at least respects the various quoted forms of macro arguments, but you’ll have to admit, it is not a pretty sight. And not exhaustively tested. Using it can and probably will recognize false positives, for example in the following input:

`<<not_a_macro_call_because_of_backticks>>`

So my initial idea was to ditch the parsing altogether and have procedures like time return plain values (like 1s for one second) by default. The problem being, as you pointed out, that the results from these procedures can’t be used to compute anything without wikifying them to temporary variables, and that would be just too much of a hassle.

A possible solution could be to use functions instead of procedures. Now math expressions would be easy, but emitting styled and rendered WikiText would not, requiring a similar wikification workaround.

So I spent some time trying to rewrite your example to be simpler and easier to follow through and maintain, but I’m still not sure whether that’s the optimal way. I still believe that tiddler fields should not violate the first normal form (from database design) whenever possible (the only composite field types that I use are title lists and JSON-formatted values, since TW makes it easier to parse JSON.)

Anyway, here’s what I ended up with:

\procedure macro-shorthand-regexp-start()
^\<\<
\end

\procedure macro-shorthand-regexp-end()
\>\>$
\end

\procedure macro-shorthand-regexp()
<<(?<macroName>[^\s\(\>]+)(?:\s+(?<macroParams>(?:(?:(?:[-a-zA-Z0-9_]+\s*:\s*)?(?:\"\"\"(?:(?!\"\"\").)*\"\"\"|(?:\"[^\"]*\")|(?:\'[^\']*\')|(?:\[\[[^\]]*\]\])|(?:(?:(?!\>\>)[^\"\'\s])+)))\s*)*))?\s*>>
\end

\function isolate-macro-calls(value)
  [<value>search-replace:g:regexp<macro-shorthand-regexp>,[###$&###]]
  +[splitregexp[###]!is[blank]]
\end

\procedure timeparse2(value)
<$list filter="[function[isolate-macro-calls],<value>]" variable="fragment">
  <% if [<fragment>regexp<macro-shorthand-regexp-start>]
      :then[<fragment>regexp<macro-shorthand-regexp-end>]
    %>
    <!-- fragment looks like a macro -->
    <<fragment>>
  <% else %>
    <!-- fragment looks like in-between text -->
    <$list filter="[<fragment>split[ ]]" variable="seg" join=" ">
      <% if [<seg>regexp[^\d+(s|m|h|d)$]] %>
        <$macrocall $name="time" value=<<seg>>/>
      <% elseif [<seg>regexp[^\d+(s|m|h|d),$]] %>
        <$let segm={{{ [<seg>split[,]first[]] }}}>
        <$macrocall $name="time" value=<<segm>>/>, </$let>
      <% else %>
        <<seg>>
      <% endif %>
    </$list>
  <% endif %>
</$list>
\end
1 Like

my last 2 cents is to add debugging for each step and use

<$codeblock code={{{  [.....] +[join[
]] }}} />
``` to see what is being returned at each step. I find it very helpful when trying to work out complex procedures.
2 Likes

Wow Roland, thanks for your efforts!

Well it might not be the optimized way, but it’s the first way that actually works:
image

Now to be honest, the prospect of incorporating the math to your approach in order to account for rules-induced discounts and whatnot is a bit much for me right now; I had all but forsaken this kind of edge case and moved the extra bits of information into a notes field to be used whenever wherever I’d need. I will probably revisit the original idea of building a resilient way to both compute and style fields that hold values and extra bits, but later :stuck_out_tongue_winking_eye:

In any case, your code and thinking are invaluable to me thank you again for investing your time in understanding the context and coming up with this working code!

1 Like

Awesome tip! thanks :slight_smile: