I vs i - Capitalization

11 views
Skip to first unread message

K in KOP

unread,
Feb 24, 2025, 4:48:54 PMFeb 24
to TextSoap
My workflow is to drag an audio file into Descript, and then cut and paste the text from Descript into TextSoap, where I then run a customized group of cleaners through it. 

Every now and then I noticed that textSoap will de-capitalize words that should be capitalized and already were in Descript. Most glaringly, it will take the pronoun "I" and make it a lowercase "i" in all instances such as:

I'm sure I left early enough
becomes
i'm sure i left early enough

How do I fix this? Thanks.

Mark Munz

unread,
Feb 25, 2025, 1:13:49 PMFeb 25
to text...@googlegroups.com
Most likely what is happening is that one of the cleaners you are calling is first converting the text to lowercase as part of its process.
For example:

Capitalize Sentences with Lowercasing
Capitalize Lines with Lowercasing

Seeing the list and order of cleaners being applied would help narrow down the specific one that might be causing the issue.

--
You received this message because you are subscribed to the Google Groups "TextSoap" group.
To unsubscribe from this group and stop receiving emails from it, send an email to textsoap+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/textsoap/46944bb5-03c2-4394-92cc-9d640ffce3a0n%40googlegroups.com.


--
Mark Munz
unmarked software
https://textsoap.com/

K in KOP

unread,
Feb 25, 2025, 4:30:46 PMFeb 25
to TextSoap
Thanks, Mark - Here is a screenshot of the group cleaner as well as the individual part of that group that deals with Caps. (Not sure if it's the cause, but I suspect this one since it's the only one with a Caps function).Screenshot 2025-02-25 at 4.25.13 PM.png
Screenshot 2025-02-25 at 4.25.40 PM.png

Joe

unread,
Feb 26, 2025, 3:28:35 PMFeb 26
to TextSoap
Try regex find and replace

(?<=\s)i(?=\s|\b)

Reply all
Reply to author
Forward
0 new messages