Adds correct UTF-8 encoding to Net::BER::BerIdentifiedString by andibachmann · Pull Request #242 · ruby-ldap/ruby-net-ldap

andibachmann · 2015-12-14T13:51:56Z

Dear Net-LDAP maintainers

Currently, net-ldap returns values containing umlauts (e.g. 'Müller') with an encoding as 'ASCII-8BIT'.
This is wrong. LDAP in version 3 should encode all data in 'UTF-8' and therefore when
casting returned ''string'' data into a Net::BER::BerIdentifiedString the encoding should be 'UTF-8'.

The current code tries to '#encode('UTF-8')' which results in an
Encoding::UndefinedConversionError: "\xC3" from ASCII-8BIT to UTF-8. This error is trapped
and the binary string is returned instead.

If we assume that the data coming from our LDAP/AD-Server is correctly encoded in 'UTF-8', there is
no need to use #encode. The only thing is to set correctly the encoding of the (string) data, i.e. to use force_encoding.

Example:

bin_str = "Müller".b
p [:bin_str, bin_str.encoding, bin_str.bytes, bin_str]
# => [:bin_str, #<Encoding:ASCII-8BIT>, [77, 195, 188, 108, 108, 101, 114], "M\xC3\xBCller"]

# Now try to 'encode' it:
bin_str.encode('UTF-8')
# =>Encoding::UndefinedConversionError: "\xC3" from ASCII-8BIT to UTF-8

# Better:
bin_str.force_encoding('UTF-8')
p [:bin_str, bin_str.encoding, bin_str.bytes, bin_str]
# => [:bin_str, #<Encoding:UTF-8>, [77, 195, 188, 108, 108, 101, 114], "Müller"]

I must admit that I have only checked the code (and added one test case) for Umlauts (and characters more or less covered by ISO-8859-1). I have no idea how to test against
Korean, Japanese, Russian, or Chinese encodings.

Nevertheless, I am pretty sure that the current code is bogus for any 'non-ASCII' characters.

Please let me know, if you need further details and I'd be happy to help you out.

regards
andi

jch · 2015-12-14T21:50:28Z

test/ber/test_ber.rb

Thanks for fixing this.

jch · 2015-12-14T21:55:10Z

@andibachmann thank you for taking the time to open this pull request and including helpful comments. I think this should be fine and compatible with the past changes in #212 because we're changing the internal string object. Is there anything else you would like me to review specifically before I merge?

andibachmann · 2015-12-15T07:28:57Z

@jch, besides special Encodings like Japanese, Chinese and Korean, I'm pretty sure the code is OK.
thanks for the review!

satoryu · 2015-12-15T07:41:21Z

besides special Encodings like Japanese, Chinese and Korean, I'm pretty sure the code is OK.

Yes, I have a monkey patch as well as your changes does :)

jch · 2015-12-15T15:53:09Z

Thanks all for chiming in. Merging.

Adds correct UTF-8 encoding to Net::BER::BerIdentifiedString

=== Net::LDAP 0.13.0 * Set a connect_timeout for the creation of a socket {#243}[ruby-ldap/ruby-net-ldap#243] * Update bundler before installing gems with bundler {#245}[ruby-ldap/ruby-net-ldap#245] * Net::LDAP#encryption accepts string {#239}[ruby-ldap/ruby-net-ldap#239] * Adds correct UTF-8 encoding to Net::BER::BerIdentifiedString {#242}[ruby-ldap/ruby-net-ldap#242] * Remove 2.3.0-preview since ruby-head already is included {#241}[ruby-ldap/ruby-net-ldap#241] * Drop support for ruby 1.9.3 {#240}[ruby-ldap/ruby-net-ldap#240] * Fixed capitalization of StartTLSError {#234}[ruby-ldap/ruby-net-ldap#234]

Version 0.14.0 of the gem (actually, starting with 0.13.0) contains a code change that fixes an encoding error (Encoding::UndefinedConversionError) that happens when there are extended characters in a dn. The fix forces utf-8 encoding instead of ASCII-8BIT for objects returned from the directory. See ruby-ldap/ruby-net-ldap#242 https://bugzilla.redhat.com/show_bug.cgi?id=1367600

. adds correct UTF-8 encoding

ecce488

jch reviewed Dec 14, 2015
View reviewed changes

test/ber/test_ber.rb

Copy link

Member

jch Dec 14, 2015

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for fixing this.

jch added a commit that referenced this pull request Dec 15, 2015

Merge pull request #242 from andibachmann/master

ea21ef9

Adds correct UTF-8 encoding to Net::BER::BerIdentifiedString

jch merged commit ea21ef9 into ruby-ldap:master Dec 15, 2015

astratto pushed a commit to astratto/ruby-net-ldap that referenced this pull request Dec 18, 2015

Merge pull request ruby-ldap#242 from andibachmann/master

a970d43

Adds correct UTF-8 encoding to Net::BER::BerIdentifiedString

tomilaine mentioned this pull request Feb 2, 2016

Fix for user attributes not encoded properly. tomilaine/omniauth-ldap#1

Merged

gtanzillo mentioned this pull request Aug 19, 2016

Updating to newer version of net-ldap gem ManageIQ/manageiq#10635

Merged

dependabot-preview bot mentioned this pull request Nov 1, 2018

Update net-ldap requirement from = 0.11 to = 0.16.1 sensu-plugins/sensu-plugins-openldap#9

Closed

dependabot-preview bot mentioned this pull request Jan 25, 2019

Update net-ldap requirement from ~> 0.3.1 to ~> 0.16.1 malachaifrazier/redmine#7

Open

dependabot-preview bot mentioned this pull request Jun 6, 2019

[Security] Bump net-ldap from 0.3.1 to 0.16.1 CUGC/nyyms#25

Closed

dependabot bot mentioned this pull request Jul 2, 2019

Bump net-ldap from 0.11 to 0.16.0 in /script/ldap_test/2015-02-09 fiedl/your_platform#34

Merged

dependabot bot mentioned this pull request Dec 18, 2023

Bump the bundler group across 1 directories with 5 updates GrantBirki/ldap-api#1

Closed

dependabot bot mentioned this pull request May 14, 2024

Bump the bundler group across 1 directory with 18 updates chrismoultonsf/cartodb#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds correct UTF-8 encoding to Net::BER::BerIdentifiedString#242

Adds correct UTF-8 encoding to Net::BER::BerIdentifiedString#242
jch merged 1 commit intoruby-ldap:masterfrom
andibachmann:master

andibachmann commented Dec 14, 2015

Uh oh!

jch Dec 14, 2015

Uh oh!

jch commented Dec 14, 2015

Uh oh!

andibachmann commented Dec 15, 2015

Uh oh!

satoryu commented Dec 15, 2015

Uh oh!

jch commented Dec 15, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

andibachmann commented Dec 14, 2015

Uh oh!

jch Dec 14, 2015

Choose a reason for hiding this comment

Uh oh!

jch commented Dec 14, 2015

Uh oh!

andibachmann commented Dec 15, 2015

Uh oh!

satoryu commented Dec 15, 2015

Uh oh!

jch commented Dec 15, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants