Opened 14 years ago

Closed 13 years ago

Last modified 13 years ago

#953 closed Enhancement (invalid)

support optional encodings (for torrent names and strings)

Reported by: Pragmatic Owned by:
Priority: Normal Milestone: None Set
Component: Transmission Version: 1.20
Severity: Normal Keywords: encoding
Cc:

Description

some of the torrents I am getting don't use utf-8. they use variants of Traditional and Simplified Chinese. unfortunately, when Transmission adds these torrents to its queue, the strings are displayed as latin garbage. it's possible that the encoding has not been specified within the torrent.

please add an enhancement where if the encoding is not utf-8 or a variant of English, and the encoding is not specified or several different encodings could apply, that the user be allowed to choose which encoding to use. this could possibly be done via an encoding popup menu (with a sample of the consequent torrent name) in the Add Torrent window.

I have attached a sample torrent.

cheers.

Attachments (2)

[HDzone]大闹广昌隆.torrent (51.9 KB) - added by Pragmatic 14 years ago.
a Simplified Chinese torrent file which does not display correctly in Transmission
´óÄֹã²ý¡.52adefb5d3fa2608.torrent (51.9 KB) - added by Pragmatic 14 years ago.
a Simplified Chinese torrent which does not specify its encoding

Download all attachments as: .zip

Change History (8)

Changed 14 years ago by Pragmatic

a Simplified Chinese torrent file which does not display correctly in Transmission

comment:1 follow-up: Changed 14 years ago by Pragmatic

within the attached torrent is a segment which reads: "ecopy13:comment.utf-810". is utf-810 the encoding?

comment:2 in reply to: ↑ 1 Changed 14 years ago by mezz

Replying to Pragmatic:

within the attached torrent is a segment which reads: "ecopy13:comment.utf-810". is utf-810 the encoding?

The BitTorrent? client will reading it as UTF-8 and ignore '10'. The '10' is like a code for seperate. I don't know full BitTorrent? code, but if you check inside torrent more and you will see there are several of numbers ('10', '6' and etc) as for seperate. I am not sure if 'seperate' is a right word for that, so I will let someone to answer it more correct if need.

Changed 14 years ago by Pragmatic

a Simplified Chinese torrent which does not specify its encoding

comment:3 Changed 14 years ago by Pragmatic

in this torrent, the encoding in many places (including file names) is not specified. only by experimentation and subjective review can it be established that the encoding is Simplified Chinese EUC.

comment:4 Changed 14 years ago by Pragmatic

my apologies. please delete the second attachment. it's identical to the first.

comment:5 Changed 13 years ago by charles

  • Resolution set to invalid
  • Status changed from new to closed

a .torrent file's strings are required by the BitTorrent? specification to use the UTF-8 encoding. If these torrents aren't doing so, then they are broken.

Note: See TracTickets for help on using tickets.