Pages in topic:   < [1 2 3 4 5] >
tmx from Parallel corpus of Patent Translation Resource?
Thread poster: Noe Tessmann
Michael Beijer
Michael Beijer  Identity Verified
United Kingdom
Local time: 21:15
Member (2009)
Dutch to English
+ ...
PayPal "Donate" button on Wordbook.nl Mar 6, 2015

As I mentioned elsewhere, I will be creating a special download/information page on Wordbook.nl for this material, and possibly other large collections too, but for now I've stuck a rough-and-ready PayPal "Donate" button on my website. It's at the very top of the page on:

http://wordbook.nl/

screenshot of my PayPal Donate button

Michael

PS: All doughnut credit goes to Hans (for the concept) and my wife, Jen (for the execution)!

[Edited at 2015-03-06 23:25 GMT]


 
Post removed: This post was hidden by a moderator or staff member because it was not in line with site rule
jaeger-berlin
jaeger-berlin
Germany
Local time: 22:15
English to German
Looking for somebody who could re-upload the files Dec 21, 2017

Dear ProZ-members,

Is there anybody who could upload the GermanEnglish tmx files again?
This would make a great Christmas present for me.

I'm looking forward to this parallel corpus.
Thanks in advance for all the effort you put in this, especially to Michael Joseph.

Season's greetings


 
Michael Beijer
Michael Beijer  Identity Verified
United Kingdom
Local time: 21:15
Member (2009)
Dutch to English
+ ...
Merry Christmas! Dec 25, 2017

jaeger-berlin wrote:

Dear ProZ-members,

Is there anybody who could upload the GermanEnglish tmx files again?
This would make a great Christmas present for me.

I'm looking forward to this parallel corpus.
Thanks in advance for all the effort you put in this, especially to Michael Joseph.

Season's greetings


I’m in Barcelona for Christmas, but will upload them when I get back to the UK!

Michael


 
VALERIJ SERAFIN
VALERIJ SERAFIN
Lithuania
German to Russian
+ ...
Yes, please! Jan 24, 2018

Please re-upload kind sir,

Thank you in advance.


 
Noe Tessmann
Noe Tessmann  Identity Verified
Local time: 22:15
English to German
+ ...
TOPIC STARTER
TMX too big? Jan 25, 2018

Dear Michael,

when I try to import your patent corpus into MemoQ it always fails. It seems that the corpus is too big for the CAT Tool. I can only use it in TMLookup.
Are you able to use it in a standard CAT tool?

KR

Noe


 
Michael Beijer
Michael Beijer  Identity Verified
United Kingdom
Local time: 21:15
Member (2009)
Dutch to English
+ ...
Hi everyone, here they are! Jan 25, 2018

Sorry for the delay, am back at work again.

Here are all the files: http://beijer.uk/storage2/PatTR/

Capture

About not being able to import them into memoQ, I wouldn't really know, as I haven't tried it myself. I also don't really use memoQ anymore, just CafeTran. But that's another story!

I have found that the only way to really use TMX collections of the size is either in TMLookup, or in LogiTerm (which is how I am managing everything currently).

Michael

PS: "en-fr" is currently still being uploaded, and should be ready in 30 mins or so!

[Edited at 2018-01-25 13:39 GMT]


 
Yolande Hivart
Yolande Hivart
Austria
Local time: 22:15
Member (2016)
German to French
Thank you very much Jan 25, 2018

Thank you very much for the DE-FR, I did not expect it.

Yolande


 
Michael Beijer
Michael Beijer  Identity Verified
United Kingdom
Local time: 21:15
Member (2009)
Dutch to English
+ ...
happy it's useful to someone! Jan 25, 2018

Haneder wrote:

Thank you very much for the DE-FR, I did not expect it.

Yolande




 
Rolf Keller
Rolf Keller
Germany
Local time: 22:15
English to German
Gimme more RAM Jan 26, 2018

Michael Joseph Wdowiak Beijer wrote:

I have found that the only way to really use TMX collections of the size is either in TMLookup, or in LogiTerm

FWIW, I downloaded the 9 CLAIMS files for en-de, unzipped them, set them as 9 data sources in Omni-Lookup and defined a single group of them. So I can search them simultaneously. It works but I need my 32 GB RAM computer, 16 GB wouldn't suffice. Probably, 16 GB were enough if you combined the 9 files into one, but I haven't tested this possibility so far.

The EN-FR files are significant smaller. I learnt a wrong language.


 
Michael Beijer
Michael Beijer  Identity Verified
United Kingdom
Local time: 21:15
Member (2009)
Dutch to English
+ ...
yeah, my work laptop has 32 GB too Jan 26, 2018

Rolf Keller wrote:

Michael Joseph Wdowiak Beijer wrote:

I have found that the only way to really use TMX collections of the size is either in TMLookup, or in LogiTerm

FWIW, I downloaded the 9 CLAIMS files for en-de, unzipped them, set them as 9 data sources in Omni-Lookup and defined a single group of them. So I can search them simultaneously. It works but I need my 32 GB RAM computer, 16 GB wouldn't suffice. Probably, 16 GB were enough if you combined the 9 files into one, but I haven't tested this possibility so far.

The EN-FR files are significant smaller. I learnt a wrong language.


...which is really needed these days when dealing with Big Data.

Michael


 
VALERIJ SERAFIN
VALERIJ SERAFIN
Lithuania
German to Russian
+ ...
Wow! Jan 26, 2018

Thank you for the swift response.

This is one of the best seasonal presents ever!


 
FarkasAndras
FarkasAndras  Identity Verified
Local time: 22:15
English to Hungarian
+ ...
Wrong approach Jan 27, 2018

Rolf Keller wrote:

Michael Joseph Wdowiak Beijer wrote:

I have found that the only way to really use TMX collections of the size is either in TMLookup, or in LogiTerm

FWIW, I downloaded the 9 CLAIMS files for en-de, unzipped them, set them as 9 data sources in Omni-Lookup and defined a single group of them. So I can search them simultaneously. It works but I need my 32 GB RAM computer, 16 GB wouldn't suffice. Probably, 16 GB were enough if you combined the 9 files into one, but I haven't tested this possibility so far.

The EN-FR files are significant smaller. I learnt a wrong language.

Even if you can get it to work, it will take a while to read 10-20GB into memory. IMO it would be better to use a search tool that doesn't load the whole TM into memory.


 
Daniel James Holt (X)
Daniel James Holt (X)
Germany
German to English
+ ...
Still active? Jan 6, 2021

Dear OP,

I just stumbled across this thread whilst doing some research on Cafetran and am curious to know whether your extensive database is still available.

Kind regards,
Daniel Holt


 
Michael Beijer
Michael Beijer  Identity Verified
United Kingdom
Local time: 21:15
Member (2009)
Dutch to English
+ ...
yup! Jan 6, 2021

Daniel James Holt wrote:

Dear OP,

I just stumbled across this thread whilst doing some research on Cafetran and am curious to know whether your extensive database is still available.

Kind regards,
Daniel Holt


Hi Daniel,

Yes it is. Just uploading it to my new website. Will post details here soon...

Michael

####

OK, ready, here you go:

https://beijerpedia.com/wiki/PatTR:_Patent_Translation_Resource_alignments_(aligned_by_Michael_Beijer,_in_2015)

Michael

[Edited at 2021-01-06 18:13 GMT]


 
Pages in topic:   < [1 2 3 4 5] >


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

tmx from Parallel corpus of Patent Translation Resource?






TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »
Trados Business Manager Lite
Create customer quotes and invoices from within Trados Studio

Trados Business Manager Lite helps to simplify and speed up some of the daily tasks, such as invoicing and reporting, associated with running your freelance translation business.

More info »