On 21 oct. 2012, at 09:48, Robert A. Rosenberg wrote:
> At 08:00 +0200 on 10/21/2012, Andrew Brown wrote about Re: Batch convert to utf8:
>
>> On 20 oct. 2012, at 20:34, Rich Siegel wrote:
>>
>>> Add a Replace All action to your text factory, which searches for "charset=iso-8859-1" (or whatever specification it is that occurs in your documents; check first) and changes it to specify UTF-8.
>>
>> Tried two Replace All in one Factory
>>
>> <!doctype(.+?)>
>> <meta(.+?)>
>>
>> and got "Insufficient memory to complete this operation".
>>
>> Worked ok one at a time in Multi-File Search.
>>
>> I still don't see why error messages cannot be copied, but I suppose that all developers have a team devoted to keeping the user in his place.
>>
>> AB
>
> Here are the two statements that need to be altered:
>
> <meta http-equiv="content-type" content="text/html; charset=utf-8" />
> <?xml version="1.0" encoding="utf-8"?>
>
> DOCTYPE does not have any indication of what character set the file uses so I do not know why you were checking it.
I was taking that away because I don't need any of the HTML etc. headers. I would have deleted them all in one go, except that the files are hopelessly inconsistent and no single search could zap all various combinations of headers and extraneous code of and sort and another.
> If you do multi-file search but code SAVE and DO-NOT-PROMPT you will not run into a insufficient memory error (it only is handling one file at a time as opposed to the LEAVE OPEN option).
Yes, I know, multi-file search works fine, it is the factory that falls over, with more than one replace, even with save and do-not-prompt. Which is a great shame, because I have multiple search-and-replaces to run and will have to find another solution.
AB