Re^2: Japanese character in Linux

Replies are listed 'Best First'.
Re^3: Japanese character in Linux by Corion (Patriarch) on Jul 07, 2011 at 14:20 UTC
You need to check five things: In what encoding is the data stored in the Sybase database? Does your script `Encode::decode` the data from the proper encoding? In what encoding is the data stored in the Oracle database? Does your script `Encode::encode` the data to the proper encoding? Does your script output to the console in the encoding that the console uses?	[reply]
Re^4: Japanese character in Linux by prafulltc (Acolyte) on Jul 08, 2011 at 06:04 UTC
In Sybase Japanese data columns are encoded in Shift-JIS encoding. We are retrieving this data using DBI. use DBI qw(:sql_types); if ( @row = $dbFOX_sth->fetchrow_array ) { ( $sInstrumentNameJ, $sInstrumentShortJ ) = @row; When we print this data in unix console it comes as junk. After we get this value in a variable we pass this to a stored proc which inserts data in Oracle Nvarchar2 data type field. Here it comes as inverted ?. Please advise.	[reply]
Re^5: Japanese character in Linux by andal (Hermit) on Jul 08, 2011 at 07:43 UTC
I guess, we'll have to go step by step. First, add "use Encode;". After you've obtained the values from DB, check if they are converted to internal perl encoding using `print Encode::is_utf8($sInstrumentNameJ), "\n";` [download] If this produces "1", then the value is converted to perl's internal form and we should check how you output it to the terminal. If this produces empty string, then the value is not converted by the driver. In this case you have to convert it manually. In either case, we have to know which locale is active in your terminal emulator. Normally, it shall be some UTF-8 locale, but who knows. Please provide output of "locale" command. Also, if the "is_utf8" function produces empty string, it would be good to provide here the hexdump of the value you get from the database. Using this way for example `print unpack("H*", $sInstrumentNameJ), "\n";` [download] And also the Japanese text it should correspond to.	[reply] [d/l] [select]
Re^6: Japanese character in Linux by prafulltc (Acolyte) on Jul 08, 2011 at 09:33 UTC
Re^7: Japanese character in Linux by andal (Hermit) on Jul 08, 2011 at 09:56 UTC
Some notes below your chosen depth have not been shown here
Re^5: Japanese character in Linux by Corion (Patriarch) on Jul 08, 2011 at 06:42 UTC
Please see points 2 to 5 of my reply.	[reply]
Re^3: Japanese character in Linux by mpeppler (Vicar) on Jul 19, 2011 at 09:27 UTC
What is the charset of the Sybase server? Are the strings stored in univarchar() or in varchar() columns? If the charset is not set to utf8, and if the columns are not univarchar(), then you need to either have the client code (DBI/DBD::Sybase) use the utf8 charset when fetching the data, or use the same charset as the dataserver. If the data is stored in univarchar() columns then you should set your client charset to utf8. DBD::Sybase 1.12 should have better support for unicode/utf8 characters, btw. Michael	[reply]


The stupid question is the question not asked
	PerlMonks