Internet DRAFT - draft-jiang-dhc-dhcp-privacy

draft-jiang-dhc-dhcp-privacy







dhc                                                             S. Jiang
Internet-Draft                              Huawei Technologies Co., Ltd
Intended status: Standards Track                             S. Krishnan
Expires: April 30, 2015                                         Ericsson
                                                            T. Mrugalski
                                                                     ISC
                                                        October 27, 2014


                    Privacy considerations for DHCP
                    draft-jiang-dhc-dhcp-privacy-00

Abstract

   DHCP is a protocol that is used to provide addressing and
   configuration information to IPv4 hosts.  This document discusses the
   various identifiers used by DHCP and the potential privacy issues.

Status of This Memo

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF).  Note that other groups may also distribute
   working documents as Internet-Drafts.  The list of current Internet-
   Drafts is at http://datatracker.ietf.org/drafts/current/.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   This Internet-Draft will expire on April 30, 2015.

Copyright Notice

   Copyright (c) 2014 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (http://trustee.ietf.org/license-info) in effect on the date of
   publication of this document.  Please review these documents
   carefully, as they describe your rights and restrictions with respect
   to this document.  Code Components extracted from this document must
   include Simplified BSD License text as described in Section 4.e of




Jiang, et al.            Expires April 30, 2015                 [Page 1]

Internet-Draft         DHCP Privacy considerations          October 2014


   the Trust Legal Provisions and are provided without warranty as
   described in the Simplified BSD License.

Table of Contents

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   2
   2.  Terminology . . . . . . . . . . . . . . . . . . . . . . . . .   3
   3.  Identifiers in DHCP . . . . . . . . . . . . . . . . . . . . .   3
     3.1.  Client ID Option  . . . . . . . . . . . . . . . . . . . .   3
     3.2.  Address Fields & Options  . . . . . . . . . . . . . . . .   4
     3.3.  Subscriber-ID Option  . . . . . . . . . . . . . . . . . .   4
     3.4.  Relay Agent Information Option and Sub-options  . . . . .   4
     3.5.  Client FQDN Option  . . . . . . . . . . . . . . . . . . .   5
     3.6.  Parameter Request List Option . . . . . . . . . . . . . .   5
     3.7.  Vendor Class and Vendor-Identifying Vendor Class Options    5
     3.8.  Civic Location Option . . . . . . . . . . . . . . . . . .   6
     3.9.  Coordinate-Based Location Option  . . . . . . . . . . . .   6
     3.10. Client System Architecture Type Option  . . . . . . . . .   6
   4.  Existing Mechanisms That Affect Privacy . . . . . . . . . . .   6
     4.1.  DNS Updates . . . . . . . . . . . . . . . . . . . . . . .   6
     4.2.  Allocation strategies . . . . . . . . . . . . . . . . . .   7
   5.  Attacks . . . . . . . . . . . . . . . . . . . . . . . . . . .   8
     5.1.  Device type discovery . . . . . . . . . . . . . . . . . .   8
     5.2.  Operating system discovery  . . . . . . . . . . . . . . .   8
     5.3.  Finding location information  . . . . . . . . . . . . . .   8
     5.4.  Finding previously visited networks . . . . . . . . . . .   9
     5.5.  Finding a stable identity . . . . . . . . . . . . . . . .   9
     5.6.  Pervasive monitoring  . . . . . . . . . . . . . . . . . .   9
     5.7.  Finding client's IP address or hostname . . . . . . . . .   9
     5.8.  Correlation of activities over time . . . . . . . . . . .   9
     5.9.  Location tracking . . . . . . . . . . . . . . . . . . . .   9
     5.10. Leasequery & bulk leasequery  . . . . . . . . . . . . . .  10
   6.  Security Considerations . . . . . . . . . . . . . . . . . . .  10
   7.  Privacy Considerations  . . . . . . . . . . . . . . . . . . .  10
   8.  IANA Considerations . . . . . . . . . . . . . . . . . . . . .  10
   9.  Acknowledgements  . . . . . . . . . . . . . . . . . . . . . .  10
   10. References  . . . . . . . . . . . . . . . . . . . . . . . . .  10
     10.1.  Normative References . . . . . . . . . . . . . . . . . .  11
     10.2.  Informative References . . . . . . . . . . . . . . . . .  12
   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  12

1.  Introduction

   Dynamic Host Configuration Protocol (DHCP) [RFC2131] is a protocol
   that is used to provide addressing and configuration information to
   IPv4 hosts.  The DHCP protocol uses several identifiers that could
   become a source for gleaning additional information about the IPv4
   host.  This information may include device type, operating system



Jiang, et al.            Expires April 30, 2015                 [Page 2]

Internet-Draft         DHCP Privacy considerations          October 2014


   information, location(s) that the device may have previously visited,
   etc.  This document discusses the various identifiers used by DHCP
   and the potential privacy issues [RFC6973].

   Future works may propose protocol changes to fix the privacy issues
   that have been analyzed in this document.  It is out of scope for
   this document.

   Editor notes: for now, the document is mainly considering the privacy
   of DHCP client.  The privacy of DHCP server and relay agent are
   considered less important because they are open for public services.
   However, this may be a subject to change if further study shows
   opposite result.

2.  Terminology

   This section clarifies the terminology used throughout this document.

   Stable identifier - any property disclosed by a DHCP client that does
   not change over time or changes very infrequently and is unique for
   said client in a given context.  Examples include MAC address,
   client-id that does not change or a hostname.  Stable identifier may
   or may not be globally unique.

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
   document are to be interpreted as described in [RFC2119].  When these
   words are not in ALL CAPS (such as "should" or "Should"), they have
   their usual English meanings, and are not to be interpreted as
   [RFC2119] key words.

3.  Identifiers in DHCP

   There are several identifiers used in DHCP.  This section provides an
   introduction to the various options that will be used further in the
   document.

3.1.  Client ID Option

   The Client Identifier Option [RFC2131] is used to pass an explicit
   client identifier to a DHCP server.  There is an analogous Server
   Identifier Option but it is not as interesting in the privacy context
   (unless a host can be convinced to start acting as a server).

   The client identifier is an opaque key, which must be unique to that
   client within the subnet to which the client is attached.  It
   typically remains stable after it has been initially generated.  It
   may contain a hardware address, identical to the contents of the



Jiang, et al.            Expires April 30, 2015                 [Page 3]

Internet-Draft         DHCP Privacy considerations          October 2014


   'chaddr' field, or another type of identifier, such as a DNS name.
   It is recommended that client identifiers be generated by using the
   permanent link-layer address of the network interface that the client
   is trying to configure.  [RFC4361] updates the recommendation of
   Client Identifiers to be "consists of a type field whose value is
   normally 255, followed by a four-byte IA_ID field, followed by the
   DUID for the client as defined in RFC 3315, section 9".  This does
   not change the lifecycle of the Client Identifiers.  Clients are
   expected to generate their Client Identifiers once (during first
   operation) and store it in a non-volatile storage or use the same
   deterministic algorithm to generate the same Client Identifier values
   again.

3.2.  Address Fields & Options

   The 'yiaddr' field [RFC2131] in DHCP message is used to allocate
   address from the server to the client.

   The DHCPv4 specification [RFC2131] provides a way to specify the
   client link-layer address in the DHCPv4 message header.  A DHCPv4
   message header has 'htype' and 'chaddr' fields to specify the client
   link-layer address type and the link-layer address, respectively.
   The 'chaddr' field is used both as a hardware address for
   transmission of reply messages and as a client identifier.

   The 'requested IP address' option [RFC2131] is used by client to
   suggest that a particular IP address be assigned.

3.3.  Subscriber-ID Option

   A DHCP relay includes a Subscriber-ID option [RFC3993] to associate
   some provider-specific information with clients' DHCP messages that
   is independent of the physical network configuration through which
   the subscriber is connected.

   The "subscriber-id" assigned by the provider is intended to be stable
   as customers connect through different paths, and as network changes
   occur.  The Subscriber-ID is an ASCII string, which is assigned and
   configured by the network provider.

3.4.  Relay Agent Information Option and Sub-options

   A DHCP relay agent includes a Relay Agent Information [RFC3046] to
   identify the remote host end of the circuit.  It contains a "circuit
   ID" sub-option for the incoming circuit, which is an agent-local
   identifier of the circuit from which a DHCP client-to-server packet
   was received, and a "remote ID" sub-option which provides a trusted
   identifier for the remote high-speed modem.



Jiang, et al.            Expires April 30, 2015                 [Page 4]

Internet-Draft         DHCP Privacy considerations          October 2014


   Possible encoding of "circuit ID" sub-option includes: router
   interface number, switching hub port number, remote access server
   port number, frame relay DLCI, ATM virtual circuit number, cable data
   virtual circuit number, etc.

   Possible encoding of the "remote ID" sub-option includes: a "caller
   ID" telephone number for dial-up connection, a "user name" prompted
   for by a remote access server, a remote caller ATM address, a "modem
   ID" of a cable data modem, the remote IP address of a point-to-point
   link, a remote X.25 address for X.25 connections, etc.

   The link-selection sub-option [RFC3527] is used by any DHCP relay
   agent that desires to specify a subnet/link for a DHCP client request
   that it is relaying but needs the subnet/link specification to be
   different from the IP address the DHCP server should use when
   communicating with the relay agent.  It contains an IP address, which
   can identify the client's subnet/link.

3.5.  Client FQDN Option

   The Client Fully Qualified Domain Name (FQDN) option [RFC4702] is
   used by DHCP clients and servers to exchange information about the
   client's fully qualified domain name and about who has the
   responsibility for updating the DNS with the associated AAAA and PTR
   RRs.

   A client can use this option to convey all or part of its domain name
   to a DHCP server for the IP-address-to-FQDN mapping.  In most case a
   client sends its hostname as a hint for the server.  The DHCP server
   MAY be configured to modify the supplied name or to substitute a
   different name.  The server should send its notion of the complete
   FQDN for the client in the Domain Name field.

3.6.  Parameter Request List Option

   The Parameter Request List option [RFC2131] is used to inform the
   server about options the client wants the server to send to the
   client.  The content of a Parameter Request List option are the
   option codes for an option requested by the client.

3.7.  Vendor Class and Vendor-Identifying Vendor Class Options

   The Vendor Class option [RFC2131] and the Vendor-Identifying Vendor
   Class option [RFC3925] is used by a DHCP client to identify the
   vendor that manufactured the hardware on which the client is running.

   The information contained in the data area of this option is
   contained in one or more opaque fields that identify the details of



Jiang, et al.            Expires April 30, 2015                 [Page 5]

Internet-Draft         DHCP Privacy considerations          October 2014


   the hardware configuration of the host on which the client is
   running, or of industry consortium compliance, for example, the
   version of the operating system the client is running or the amount
   of memory installed on the client.

3.8.  Civic Location Option

   DHCP servers use the Civic Location Option [RFC4776] to delivery of
   the location information (the civic and postal addresses) to the DHCP
   clients.  It may refer to three locations: the location of the DHCP
   server, the location of the network element believed to be closest to
   the client, or the location of the client, identified by the "what"
   element within the option.

3.9.  Coordinate-Based Location Option

   The GeoConf and GeoLoc options [RFC6225] is used by DHCP server to
   provide the coordinate-based geographic location information to the
   DHCP clients.  It enables a DHCP client to obtain its geographic
   location.

   After the relevant DHCP exchanges have taken place, the location
   information is stored on the end device rather than somewhere else,
   where retrieving it might be difficult in practice.

3.10.  Client System Architecture Type Option

   The Client System Architecture Type Option [RFC4578] is used by DHCP
   client to send a list of supported architecture types to the DHCP
   server.  It is used to provide configuration information for a node
   that must be booted using the network rather than from local storage.

4.  Existing Mechanisms That Affect Privacy

   This section describes available DHCP mechanisms that one can use to
   protect or enhance one's privacy.

4.1.  DNS Updates

   DNS Updates [RFC4704] defines a mechanism that allows both clients
   and server to insert into DNS domain information about clients.  Both
   forward (AAAA) and reverse (PTR) resource records can be updated.
   This allows other nodes to conveniently refer to a host, despite the
   fact that its IP address may be changing.

   This mechanism exposes two important pieces of information: current
   address (which can be mapped to current location) and client's
   hostname.  The stable hostname can then by used to correlate the



Jiang, et al.            Expires April 30, 2015                 [Page 6]

Internet-Draft         DHCP Privacy considerations          October 2014


   client across different network attachments even when its IP
   addresses keep changing.

4.2.  Allocation strategies

   A DHCP server running in typical, stateful mode is given a task of
   managing one or more pools of IP address resources.  When a client
   requests a resource, server must pick a resource out of configured
   pool.  Depending on the server's implementation, various allocation
   strategies are possible.  Choices in this regard may have privacy
   implications.

   Iterative allocation - a server may choose to allocate addresses one
   by one.  That strategy has the benefit of being very fast, thus can
   be favored in deployments that prefer performance.  However, it makes
   the resources very predictable.  Also, since the resources allocated
   tend to be clustered at the beginning of available pool, it makes
   scanning attacks much easier.

   Identifier-based allocation - a server may choose to allocate an
   address that is based on one of available identifiers, e.g. client
   identifier or MAC address.  It is also convenient, as returning
   client is very likely to get the same address.  Those properties are
   convenient for system administrators, so DHCP server implementors are
   often requested to implement it.  On the other hand, the downside of
   such allocation is that the client has a very stable IP address.
   That means that correlation of activities over time, location
   tracking, address scanning and OS/vendor discovery apply.

   Hash allocation - it's an extension of identifier based allocation.
   Instead of using the identifier directly, it is being hashed first.
   If the hash is implemented correctly, it removes the flaw of
   disclosing the identifier, a property that eliminates susceptibility
   to address scanning and OS/vendor discovery.  If the hash is poorly
   implemented (e.g. can be reverted), it introduces no improvement over
   identifier-based allocation.

   Random allocation - a server can pick a resource randomly out of
   available pool.  That strategy works well in scenarios where pool
   utilization is small, as the likelihood of collision (resulting in
   the server needing to repeat randomization) is small.  With the pool
   allocation increasing, the collision is disproportionally large, due
   to birthday paradox.  With high pool utilization (e.g. when 90% of
   available resources being allocated already), the server will use
   most computational resources to repeatedly pick a random resource,
   which will degrade its performance.  This allocation scheme
   essentially prevents returning clients from getting the same address
   again.  On the other hand, it is beneficial from privacy perspective



Jiang, et al.            Expires April 30, 2015                 [Page 7]

Internet-Draft         DHCP Privacy considerations          October 2014


   as addresses generated that way are not susceptible to correlation
   attacks, OS/vendor discovery attacks or identity discovery attacks.
   Note that even though the address itself may be resilient to a given
   attack, the client may still be susceptible if additional information
   is disclosed other way, e.g. client's address can be randomized, but
   it still can leak its MAC address in client-id option.

   Other allocation strategies may be implemented.

   However, giving the limited resource of IPv4 public address pool,
   allocation mechanism in IPv4 may not provide much protection, while
   in IPv6, the network has very large address space to distribute the
   address allocation.

5.  Attacks

5.1.  Device type discovery

   The type of device used by the client can be guessed by the attacker
   using the Vendor Class Option, the 'chaddr' field, and by parsing the
   Client ID Option.  All of those options may contain OUI
   (Organizationally Unique Identifier) that represents the device's
   vendor.  That knowledge can be used for device-specific vulnerability
   exploitation attacks.

5.2.  Operating system discovery

   The operating system running on a client can be guessed using the
   Vendor Class option, the Client System Architecture Type option, or
   by using fingerprinting techniques on the combination of options
   requested using the Parameter Request List option.

5.3.  Finding location information

   The location information can be obtained by the attacker by many
   means.  The most direct way to obtain this information is by looking
   into a server initiated message that contains the Civic Location,
   GeoConf, or GeoLoc options.  It can also be indirectly inferred using
   the Relay Agent Information option, with the remote ID sub-option
   (e.g. using a telephone number), the circuit ID option (e.g. if an
   access circuit on an Access Node corresponds to a civic location), or
   the Subscriber ID Option (if the attacker has access to subscriber
   info).








Jiang, et al.            Expires April 30, 2015                 [Page 8]

Internet-Draft         DHCP Privacy considerations          October 2014


5.4.  Finding previously visited networks

   When DHCP clients connect to a network, they attempt to obtain the
   same address they had used before they attached to the network.  They
   do this by putting the previously assigned address in the requested
   IP address option.  By observing these addresses, an attacker can
   identify the network the client had previously visited.

5.5.  Finding a stable identity

   An attacker might use a stable identity gleaned from DHCP messages to
   correlate activities of a given client on unrelated networks.  The
   Client FQDN option, the Subscriber ID Option and the Client ID
   options can serve as long lived identifiers of DHCP clients.  The
   Client FQDN option can also provide an identity that can easily be
   correlated with web server activity logs.

5.6.  Pervasive monitoring

   This is an enhancement, or a combination of most aforementioned
   mechanisms.  Operator who controls non-trivial number of access
   points or network segments, may use obtained information about a
   single client and observer client's habits.

5.7.  Finding client's IP address or hostname

   Many DHCP deployments use DNS Updates [RFC4702] that put client's
   information (current IP address, client's hostname).  Client ID is
   also disclosed, able it in not easily accessible form (SHA-256 digest
   of the client-id).  Although SHA-256 is irreversible, so DHCID can't
   be converted back to client-id.  However, SHA-256 digest can be used
   as a unique identifier that is accessible by any host.

5.8.  Correlation of activities over time

   As with other identifiers, an IP address can be used to correlate the
   activities of a host for at least as long as the lifetime of the
   address.  If that address was generated from some other, stable
   identifier and that generation scheme can be deducted by an attacker,
   the duration of correlation attack extends to that identifier.  In
   many cases, its lifetime is equal to the lifetime of the device
   itself.

5.9.  Location tracking

   If a stable identifier is used for assigning an address and such
   mapping is discovered by an attacker.  In particular both passive (a
   service that the client connects to can log client's address and draw



Jiang, et al.            Expires April 30, 2015                 [Page 9]

Internet-Draft         DHCP Privacy considerations          October 2014


   conclusions regarding its location and movement patterns based on
   address it is connecting from) and active (attacker can send ICMP
   echo requests or other probe packets to networks of suspected client
   locations).

5.10.  Leasequery & bulk leasequery

   Attackers may pretend as an access concentrator, either DHCP relay
   agent or DHCP client, to obtain location information directly from
   the DHCP server(s) using the DHCP Leasequery [RFC4388], [RFC6148]
   mechanism.

   Location information is information needed by the access concentrator
   to forward traffic to a broadband-accessible host.  This information
   includes knowledge of the host hardware address, the port or virtual
   circuit that leads to the host, and/or the hardware address of the
   intervening subscriber modem.

   Furthermore, the attackers may use DHCP bulk leasequery [RFC6926]
   mechanism to obtain bulk information about DHCP bindings, even
   without knowing the target bindings.

6.  Security Considerations

   TBD

7.  Privacy Considerations

   This document at its entirety discusses privacy considerations in
   DHCP.  As such, no separate section about this is needed.

8.  IANA Considerations

   This draft does not request any IANA action.

9.  Acknowledgements

   The authors would like to thanks the valuable comments made by
   Stephen Farrell, Ted Lemon, Ines Robles, Russ White, Christian
   Schaefer and other members of DHC WG.

   This document was produced using the xml2rfc tool [RFC2629].

10.  References







Jiang, et al.            Expires April 30, 2015                [Page 10]

Internet-Draft         DHCP Privacy considerations          October 2014


10.1.  Normative References

   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
              Requirement Levels", BCP 14, RFC 2119, March 1997.

   [RFC2131]  Droms, R., "Dynamic Host Configuration Protocol", RFC
              2131, March 1997.

   [RFC3046]  Patrick, M., "DHCP Relay Agent Information Option", RFC
              3046, January 2001.

   [RFC3527]  Kinnear, K., Stapp, M., Johnson, R., and J. Kumarasamy,
              "Link Selection sub-option for the Relay Agent Information
              Option for DHCPv4", RFC 3527, April 2003.

   [RFC3925]  Littlefield, J., "Vendor-Identifying Vendor Options for
              Dynamic Host Configuration Protocol version 4 (DHCPv4)",
              RFC 3925, October 2004.

   [RFC3993]  Johnson, R., Palaniappan, T., and M. Stapp, "Subscriber-ID
              Suboption for the Dynamic Host Configuration Protocol
              (DHCP) Relay Agent Option", RFC 3993, March 2005.

   [RFC4361]  Lemon, T. and B. Sommerfeld, "Node-specific Client
              Identifiers for Dynamic Host Configuration Protocol
              Version Four (DHCPv4)", RFC 4361, February 2006.

   [RFC4388]  Woundy, R. and K. Kinnear, "Dynamic Host Configuration
              Protocol (DHCP) Leasequery", RFC 4388, February 2006.

   [RFC4702]  Stapp, M., Volz, B., and Y. Rekhter, "The Dynamic Host
              Configuration Protocol (DHCP) Client Fully Qualified
              Domain Name (FQDN) Option", RFC 4702, October 2006.

   [RFC4704]  Volz, B., "The Dynamic Host Configuration Protocol for
              IPv6 (DHCPv6) Client Fully Qualified Domain Name (FQDN)
              Option", RFC 4704, October 2006.

   [RFC4776]  Schulzrinne, H., "Dynamic Host Configuration Protocol
              (DHCPv4 and DHCPv6) Option for Civic Addresses
              Configuration Information", RFC 4776, November 2006.

   [RFC6148]  Kurapati, P., Desetti, R., and B. Joshi, "DHCPv4 Lease
              Query by Relay Agent Remote ID", RFC 6148, February 2011.

   [RFC6225]  Polk, J., Linsner, M., Thomson, M., and B. Aboba, "Dynamic
              Host Configuration Protocol Options for Coordinate-Based
              Location Configuration Information", RFC 6225, July 2011.



Jiang, et al.            Expires April 30, 2015                [Page 11]

Internet-Draft         DHCP Privacy considerations          October 2014


   [RFC6926]  Kinnear, K., Stapp, M., Desetti, R., Joshi, B., Russell,
              N., Kurapati, P., and B. Volz, "DHCPv4 Bulk Leasequery",
              RFC 6926, April 2013.

10.2.  Informative References

   [RFC2629]  Rose, M., "Writing I-Ds and RFCs using XML", RFC 2629,
              June 1999.

   [RFC4578]  Johnston, M. and S. Venaas, "Dynamic Host Configuration
              Protocol (DHCP) Options for the Intel Preboot eXecution
              Environment (PXE)", RFC 4578, November 2006.

   [RFC6973]  Cooper, A., Tschofenig, H., Aboba, B., Peterson, J.,
              Morris, J., Hansen, M., and R. Smith, "Privacy
              Considerations for Internet Protocols", RFC 6973, July
              2013.

Authors' Addresses

   Sheng Jiang
   Huawei Technologies Co., Ltd
   Q14, Huawei Campus, No.156 Beiqing Road
   Hai-Dian District, Beijing, 100095
   P.R. China

   Email: jiangsheng@huawei.com


   Suresh Krishnan
   Ericsson
   8400 Decarie Blvd.
   Town of Mount Royal, QC
   Canada

   Phone: +1 514 345 7900 x42871
   Email: suresh.krishnan@ericsson.com


   Tomek Mrugalski
   Internet Systems Consortium, Inc.
   950 Charter Street
   Redwood City, CA  94063
   USA

   Phone: +1 650 423 1345
   Email: tomasz.mrugalski@gmail.com




Jiang, et al.            Expires April 30, 2015                [Page 12]