I have a requirement to go through two large files one has 3307339 words and the other has 3998911 words.
Against these two files I need to find and replace occurances of files, at the moment they are in two columns in a spreadsheet.
So where there is an occurence of A1 for example, replace with B1 and so on.
Now I understand that I will likely need to output these, maybe to a CSV or a Tabbed CSV file, and then likely use regular expressions to do the find and replace, like this:
What I'm wondering about, is how do I take each entry as a variable?
The two files checking against are parallel texts, so one is in English, one is in Spanish but they are sentence aligned (this shouldn't matter, its just a background).
The content I need to replace looks like this:
- Code: Select all
So for every occurance of Linux I want to make sure it is replaced with Linux (probably a bad example). intermediate should be replaced with intermedio. Etc.