On-Line Appendix: STRAND Evaluation

Appendix: STRAND Experimental Data

This Web page is an on-line Appendix to

Philip Resnik, "Parallel Strands: A Preliminary Investigation into Mining the Web for Bilingual Text," to appear at AMTA-98, the 1998 Conference of the Association for Machine Translation in the Americas, October 28-31, 1998.

Please see the paper for details.

This Appendix contains:

Note that in a few cases, particularly news-reporting pages that are updated daily, the content of the URLs has changed since the evaluation was done (on 10 June 1998).


Candidate pairs STRAND identified as translations

Pairs that are false positives (in the author's judgment) are starred.
  1. http://204.71.88.85/Sana/english.htm http://204.71.88.85/Sana/spanish.htm
  2. http://info.utas.edu.au/docs/flonta/DPbooks/Bibles/Englishproverbs/proeng1.html http://info.utas.edu.au/docs/flonta/DPbooks/Bibles/Spanishproverbs/prospan1.html
  3. http://interforever-sports.com/test/osmain-EN.htm http://interforever-sports.com/test/osmain-SP.htm
  4. http://www.academyofenglish.com/english/index.htm http://www.academyofenglish.com/spanish/index.htm
  5. http://www.amazings.com/ingles.html http://www.amazings.com/espanyol.html
  6. http://www.caim.com/english.htm http://www.caim.com/espanol.htm
  7. http://www.ddex.com/index-e.html http://ddex.com/index-s.html
  8. http://www.dickow.com/parts.htm http://www.dickow.com/spanish.htm
  9. http://www.fuessen.com/fuessener-land/neuschwa.htm http://www.fuessen.com/fuessener-land/neuschws.htm
  10. http://www.galinor.es/hcoruna/univ2.html http://www.galinor.es/hcoruna/universal.html
  11. http://www.imf.org/external/np/ins/french/info/../../english/info/vienna.htm http://www.imf.org/external/np/ins/french/info/../../spanish/info/vienna.htm
  12. *http://www.international-house-london.ac.uk/general/index.html http://www.international-house-london.ac.uk/modlang/index.html
  13. http://www.legaldatasearch.com/index.html http://www.legaldatasearch.com/indexsp.html
  14. http://www.pyronics.com/prdtmn_eng.htm http://www.pyronics.com/prdtmn_span.htm
  15. http://www.solofutbol.com/ifaboutus-EN.htm http://www.solofutbol.com/ifaboutus-SP.htm
  16. *http://www.syrlang.com/D_Store/D_Direct/g_english.htm http://www.syrlang.com/D_Store/D_Direct/g_spanish.htm
  17. http://www.union-city.k12.nj.us/community/uez1/uezhome.html http://www.union-city.k12.nj.us/community/uez1/uezhome2.html

Candidate pairs STRAND identified as not being translations

Pairs that are false negatives (in the author's judgment) are starred.
  1. http://grub01.physto.se:8080/cgi-bin/english2spani.pl http://www.gmsmuc.de/english/look.html
  2. http://maxthink.com/esi/esi.31/7031 http://maxthink.com/esi/esi.32/2132
  3. http://maxthink.com/esi/esi.31/7031 http://maxthink.com/esi/esi.75/2275
  4. http://maxthink.com/esi/esi.58/6458 http://maxthink.com/esi/esi.59/6459
  5. http://maxthink.com/esi/esi.58/6458 http://maxthink.com/esi/esi.60/6460
  6. http://maxthink.com/esi/esi.60/6460 http://maxthink.com/esi/esi.59/6459
  7. http://telescan.nki.nl//index.html http://telescan.nki.nl//spanish/index.html
  8. *http://venus.rdc.puc-rio.br/kids/planmden.htm http://venus.rdc.puc-rio.br/kids/planmdsp.htm
  9. *http://www-fr.cisco.com/warp/public/734/cea/ceanf_ds.htm http://www-fr.cisco.com/warp/public/734/cea/_sp_ceanf_ds.htm
  10. *http://www.bodigard.com/english.html http://www.bodigard.com/spanish.html
  11. http://www.ces-msa.org/doas/COLEGIO/ENGLISH.HTM http://www.ces-msa.org/doas/COLEGIO/SPANISH.HTM
  12. *http://www.cisco.com/warp/public/729/c5000/grpsw_ds.htm http://www.cisco.com/warp/public/729/c5000/_sp_grpsw_ds.htm
  13. http://www.coc.cc.ca.us/offices/inst_dev/emp/section6/english/english.html http://www.coc.cc.ca.us/offices/inst_dev/emp/section6/spanish/spanish.html
  14. http://www.coc.cc.ca.us/offices/inst_dev/emp/section6/esl/esl.html http://www.coc.cc.ca.us/offices/inst_dev/emp/section6/spanish/spanish.html
  15. http://www.cybertelecom.com.br/luciana/pausini.htm http://www.cybertelecom.com.br/luciana/shakira3.htm
  16. *http://www.dekalbrealtors.com/enpage.html http://www.dekalbrealtors.com/sppage.html
  17. http://www.dickow.com/parts.htm http://www.dickow.com/spanishq.htm
  18. http://www.dickow.com/quote.htm http://www.dickow.com/spanish.htm
  19. *http://www.dickow.com/quote.htm http://www.dickow.com/spanishq.htm
  20. http://www.dresser.com/instruments/products/Duratran2279/2279Duratran.pdf http://www.dresser.com/instruments/products/Duratran2279/2279_Duratran_Span.pdf
  21. http://www.edgarcayce.com/cgi-bin/edgarcayce/englishinprint.html http://www.edgarcayce.com/cgi-bin/edgarcayce/spanish.html
  22. http://www.gmsmuc.de/english/look.html http://grub01.physto.se:8080/cgi-bin/english2spani.pl
  23. http://www.gospelcom.net/rbc/home/devo.shtml http://www.gospelcom.net/bttb/rcspanish.html
  24. http://www.gospelcom.net/rbc/home/devo.shtml http://www.netusa1.net/~jandras/spanish.html
  25. http://www.gospelcom.net/rbc/home/devo.shtml http://www.peniel-argentina.org
  26. http://www.gospelcom.net/rbc/home/devo.shtml http://www.thechristian.org/spanish/index.html
  27. http://www.gospelcom.net/rbc/home/devo.shtml http://www.wp.com/GILDAORLANDI/spanishm.htm
  28. http://www.grad.mu.edu//engl/engl.html http://www.grad.mu.edu//foll/foll.html
  29. http://www.greatlakessoftware.com/software/cims/lit_enu.pdf http://www.greatlakessoftware.com/software/cims/lit_esn.pdf
  30. http://www.gy.com/www/ww1/ww2/excel03.htm http://www.gy.com/www/ww1/ww2/char68.htm
  31. http://www.gy.com/www/ww1/ww2/excel15.htm http://www.gy.com/www/ww1/ww2/char68.htm
  32. http://www.gy.com/www/ww1/ww2/lear01.htm http://www.gy.com/www/ww1/ww2/char68.htm
  33. http://www.ictsd.org/text_only_html/../English/English.pdf http://www.ictsd.org/text_only_html/../Spanish/Spanish.pdf
  34. http://www.image.dk/~knud-sor/en/inter.html http://www.gospelcom.net/bttb/rcspanish.html
  35. http://www.image.dk/~knud-sor/en/inter.html http://www.gospelcom.net/rbc/home/devo.shtml
  36. http://www.image.dk/~knud-sor/en/inter.html http://www.netusa1.net/~jandras/spanish.html
  37. http://www.image.dk/~knud-sor/en/inter.html http://www.peniel-argentina.org
  38. http://www.image.dk/~knud-sor/en/inter.html http://www.thechristian.org/spanish/index.html
  39. http://www.image.dk/~knud-sor/en/inter.html http://www.wp.com/GILDAORLANDI/spanishm.htm
  40. http://www.imf.org/external/np/ins/french/info/../../english/index.htm http://www.imf.org/external/np/ins/french/info/../../spanish/info/vienna.htm
  41. http://www.imf.org/external/np/ins/spanish/training/../../english/index.htm http://www.imf.org/external/np/ins/spanish/training/../../spanish/training/vcourses.htm
  42. http://www.imf.org/external/np/ins/spanish/training/../../english/training/vcourses.htm http://www.imf.org/external/np/ins/spanish/training/../../spanish/training/vcourses.htm
  43. http://www.international-house-london.ac.uk/excentre/index.html http://www.international-house-london.ac.uk/modlang/index.html
  44. http://www.international-house-london.ac.uk/teacher/index.html http://www.international-house-london.ac.uk/modlang/index.html
  45. http://www.islander-magazine.com/englishgarden.html http://www.islander-magazine.com/spanish.html
  46. http://www.june29.com/IDP/IDPsearch.html http://www.travlang.com/languages/cgi-bin/langchoice.cgi?page=main&lang1=english&lang2=spanish
  47. http://www.nlnnet.com/nlnnet/unique.html http://www.gospelcom.net/bttb/rcspanish.html
  48. http://www.nlnnet.com/nlnnet/unique.html http://www.gospelcom.net/rbc/home/devo.shtml
  49. http://www.nlnnet.com/nlnnet/unique.html http://www.netusa1.net/~jandras/spanish.html
  50. http://www.nlnnet.com/nlnnet/unique.html http://www.peniel-argentina.org
  51. http://www.nlnnet.com/nlnnet/unique.html http://www.thechristian.org/spanish/index.html
  52. http://www.nlnnet.com/nlnnet/unique.html http://www.wp.com/GILDAORLANDI/spanishm.htm
  53. http://www.optichrome.com/english/distrib.html http://www.optichrome.com/spanish/index.html
  54. http://www.optichrome.com/english/index.html http://www.optichrome.com/spanish/index.html
  55. http://www.optichrome.com/english/y2000.html http://www.optichrome.com/spanish/index.html
  56. *http://www.orst.edu//hometext.htm http://www.orst.edu//sphomepe.htm
  57. http://www.peniel-argentina.org http://www.gospelcom.net/bttb/rcspanish.html
  58. http://www.peniel-argentina.org http://www.gospelcom.net/rbc/home/devo.shtml
  59. http://www.peniel-argentina.org http://www.netusa1.net/~jandras/spanish.html
  60. http://www.peniel-argentina.org http://www.thechristian.org/spanish/index.html
  61. http://www.peniel-argentina.org http://www.wp.com/GILDAORLANDI/spanishm.htm
  62. http://www.polio.com/CHK_ENG.DOC http://www.polio.com/CHK_SPN.DOC
  63. http://www.polio.com/chk_eng.htm http://www.polio.com/CHK_SPN.DOC
  64. http://www.prague.org/english.html http://www.prague.org//espanol/
  65. http://www.prague.org/schedule1.html http://www.prague.org//espanol/
  66. http://www.presidencia.gob.mx/welcome/index.htm http://www.presidencia.gob.mx
  67. http://www.rcinet.ca/en/actual/news/news.htm http://www.rcinet.ca/es/actual/news/news.htm
  68. http://www.shure.com/pdf/specsheets/spec_wireless/sc.pdf http://www.shure.com/pdf/userguides/guides_wireless/sc_es.pdf
  69. http://www.shure.com/pdf/userguides/guides_wireless/sc_en.pdf http://www.shure.com/pdf/userguides/guides_wireless/sc_es.pdf
  70. http://www.travlang.com/languages/cgi-bin/langchoice.cgi?page=main&lang1=english&lang2=spanish http://www.june29.com/IDP/IDPsearch.html
  71. *http://www.umbi.umd.edu/~edgar/fcd_eng.html http://www.umbi.umd.edu/~edgar/fcd_esp.html
  72. *http://www.westcoastdataservices.com/wcds/page27.html http://www.westcoastdataservices.com/wcds/page44.html
  73. http://www.yahoo.co.uk/Reference/Dictionaries/Language/ http://grub01.physto.se:8080/cgi-bin/ssis/~calcato/espanol.html