pranaypourkar
- Jul 23, 2023
- 8 min read

Exploring KDF - Key Derivation Function

Table of Contents

Commonly used Key Derivation Function (KDF) algorithm

Properties which make KDF algorithms effective and reliable

What makes KDFs Weak or Strong?

Hands-on with Java and Spring Boot

Common use case of user password storing

Security is very important, especially when dealing with sensitive data and information like passwords, secrets, etc. One of the aspects of a cryptographic system is the generation of strong cryptographic keys which act as a foundation for securing data, ensuring confidentiality, integrity, and authenticity. In this blog, let's explore KDF.

KDF, or Key Derivation Functions, is a cryptographic algorithm that takes an input, usually a password or a passphrase, and transforms it into a cryptographic key suitable for use in various security protocols and applications.

It is commonly used in scenarios where a secret key needs to be derived from a human-readable password, which typically has limited entropy and may be vulnerable to brute-force or dictionary attacks. By applying a KDF, the resulting key will have higher entropy and be more resistant to these types of attacks.The purpose of a KDF is to make the resulting key more secure and resistant to attacks.

Entropy measures the unpredictability or randomness of a given set of data. It quantifies the amount of randomness present in a system or source of data.

Key Derivation Functions (KDFs) are one-way functions. A one-way function is a mathematical operation or algorithm that is relatively easy to compute in one direction (forward computation), but it is computationally infeasible to reverse the process and obtain the original input from the output (backward computation)

Some of the commonly used Key Derivation Function (KDF) algorithms are given below.

PBKDF2 (Password-Based Key Derivation Function 2) PBKDF2 is a widely used KDF that applies a cryptographic hash function (such as SHA-1, SHA-256) multiple times to derive a key from a password. It allows for the customization of the iteration count and salt.
bcrypt bcrypt is a password hashing function that incorporates a KDF. It uses the Blowfish encryption algorithm and applies multiple rounds of hashing to protect against brute-force attacks.
scrypt scrypt is a KDF designed to be memory-hard, making it resistant to parallelized attacks. It uses a large amount of memory, making it computationally expensive to perform parallel computations.
Argon2 Argon2 is a modern and memory-hard KDF that offers resistance against various attacks, including brute-force, time-memory trade-off, and side-channel attacks.
HKDF (HMAC-based Key Derivation Function) HKDF is a key derivation function based on HMAC which allows for the derivation of keys of varying lengths and is commonly used for generating cryptographic keys from shared secrets.
KDF1 and KDF2 KDF1 and KDF2 are Key Derivation Functions based on a cryptographic hash function and are commonly used for deriving encryption keys from passwords.
Catena Catena is a KDF designed to be memory-hard and resistant to parallel attacks. It incorporates a chaining mechanism and introduces a time cost factor to increase the computational difficulty.
Balloon Hashing Balloon Hashing is a memory-hard KDF that aims to provide resistance against both parallel and time-memory trade-off attacks. It is designed to consume a significant amount of memory during the key derivation process.

Memory-hard refers to an algorithm or function that requires a significant amount of memory (RAM) to compute, making it computationally expensive for attackers attempting to parallelize or optimize the computation.

Some of the properties which make KDF algorithms effective and reliable in securely deriving cryptographic keys are listed below.

Key Strengthening -> KDFs must employ techniques like key stretching or iterative hashing to increase the computational effort required to derive the key. This property helps slow down potential attackers attempting to guess the input by brute force or dictionary attacks.
Salt or Nonce Usage -> KDFs often uses a salt or a nonce to derive the key. A salt is a random value that is combined with the input to prevent precomputed attacks like rainbow table attacks. A nonce, a number used once, is a unique value used to ensure that each key derivation operation produces a different output, even if the input is the same.
Domain Separation -> KDFs should ensure that the keys derived from different inputs or for different purposes are distinct and do not collide. This prevents issues where keys derived from unrelated inputs accidentally match, compromising the security of the system.
Randomness Preservation -> KDFs must try to keep as much unpredictability and randomness as possible from the original secret value (like a password) when generating a new cryptographic key. This is important because the security of the resulting key depends on the quality and randomness of the original secret. By preserving the randomness, KDFs ensure that the derived key is strong and less susceptible to attacks, making it safer for encrypting and protecting sensitive data.
Resistance to Attacks -> KDFs should resist various cryptographic attacks, such as brute force attacks, dictionary attacks, and precomputed attacks. The algorithms should provide a high level of security and make it computationally infeasible for an attacker to derive the input or recover the original key from the derived key.
Efficiency -> While security is critical, efficient computation is also important for practical implementation and performance. KDF algorithms should maintain a balance between security and performance, ensuring that key derivation operations can be carried out in a reasonable amount of time without excessive computational resources.
Standardization -> KDF algorithm should be well-defined and standardized to ensure interoperability and ease of implementation across different systems and applications.

Let's understand what makes KDFs Weak or Strong.

1. Weak KDF

A weak KDF is a Key Derivation Function that lacks sufficient security properties or fails to provide adequate protection against attacks. Weak KDFs may have vulnerabilities that can be exploited by attackers, potentially compromising the security of the derived keys. Some of the characteristics of weak KDFs include -

Lack of randomness preservation
Insufficient key stretching
Susceptibility to attacks
Non-standardized or obsolete designs

For example, MD5-based KDF is an example of a weak KDF. MD5 (Message Digest Algorithm 5) is a widely known cryptographic hash function that has been found to have vulnerabilities. It is considered insecure for most cryptographic applications due to its susceptibility to collision attacks and advances in computational power. Using MD5 as a basis for key derivation would make the resulting keys weak and vulnerable to various attacks.

2. Strong KDF

A strong KDF is a Key Derivation Function that possesses strong security properties and is designed to resist various cryptographic attacks. Strong KDFs employ well-established cryptographic techniques and have undergone extensive analysis and scrutiny by the cryptographic community. Some of the characteristics of weak KDFs include -

Key stretching and computational effort
Randomness preservation
Resistance to attacks
Standardization and scrutiny

For example, PBKDF2, bcrypt, and Argon2 are considered to be strong KDF. PBKDF2 (Password-Based Key Derivation Function 2) is widely adopted and recommended for password storage and key derivation purposes. PBKDF2 applies a pseudorandom function (such as HMAC-SHA1, HMAC-SHA256, or HMAC-SHA512) iteratively to derive a cryptographic key from a password or passphrase. It incorporates salting as well to prevent precomputed attacks like rainbow table attacks.

Algorithm	Strength	Speed
BCrypt	Medium	Slow
SCrypt	High	Very Slow
PBKDF2	Low	Medium
Argon2	Very High	Very Slow

Difference between KDF and Hash function? KDF and Hash functions are both cryptographic algorithms, but they serve different purposes and have distinct characteristics - > Purpose KDF: The primary purpose of a KDF is to derive cryptographic keys or key material from an input, such as a password or shared secret. KDFs are specifically designed to transform low-entropy inputs into high-entropy keys suitable for use in cryptographic operations. Hash Function: Hash functions are designed to take an arbitrary input and produce a fixed-size output, called a hash or message digest. Their primary purpose is to verify data integrity, detect changes or tampering in the input, and generate a unique identifier (digest) for a given input. > Input and Output KDF: KDFs usually take a secret input, such as a password or shared secret, and produce a cryptographic key or key material as the output. The output can have variable length and is generally used for encryption, authentication, or other cryptographic purposes. Hash Function: Hash functions take any input, such as a message or data, and produce a fixed-size output (hash). The output is a unique representation of the input and is commonly used for data integrity verification and digital signatures. > Key Stretching KDF: KDFs often include techniques like key stretching, where the input is subjected to multiple iterations or computations to increase the computational effort required to derive the key. Key stretching helps make the derived key more resistant to brute-force attacks. Hash Function: While some hash functions can be used in password-based key derivation, they are not inherently designed for key stretching. Hash functions aim to provide collision resistance and data integrity without the need for key stretching.

Some of the use cases for applying KDF are given below.

Key Generation -> KDFs are commonly used to generate cryptographic keys from a master key or password. This process ensures that the generated keys have sufficient entropy and are resistant to various attacks.
Password Hashing -> KDFs play a crucial role in password-based key derivation. They derive a secure and unique cryptographic hash from a user's password, adding salt and applying multiple iterations to protect against brute force and dictionary attacks.
Key Expansion -> KDFs can expand a single key into multiple keys or produce additional parameters needed for cryptographic algorithms. This allows for efficient and secure key management within cryptographic systems.
Deriving Initialization Vectors (IVs) -> KDFs can derive initialization vectors used in symmetric encryption algorithms. IVs are crucial for ensuring the uniqueness and randomness of ciphertexts, contributing to the security of encrypted data.
Key Agreement -> In some cases, KDFs are used in key agreement protocols to derive a shared secret between two or more parties. This shared secret can be used as a symmetric encryption key or as input to other cryptographic algorithms.

It's time to see some hands-on with Java and Spring Boot.

Using Spring Crypto spring-security-crypto

<dependency>
    <groupId>org.springframework.security</groupId>
    <artifactId>spring-security-crypto</artifactId>
    <version>5.7.5</version>
</dependency>

package com.company.project;

import lombok.extern.slf4j.Slf4j;
import org.springframework.boot.SpringApplication;
import org.springframework.boot.autoconfigure.SpringBootApplication;
import org.springframework.security.crypto.bcrypt.BCryptPasswordEncoder;
import org.springframework.security.crypto.password.Pbkdf2PasswordEncoder;
import org.springframework.security.crypto.argon2.Argon2PasswordEncoder;
import org.springframework.security.crypto.scrypt.SCryptPasswordEncoder;

@Slf4j
@SpringBootApplication
public class Application {

    public static void main(final String[] args) {
        SpringApplication.run(Application.class, args);

        // Password to be encoded 
        String password = "Password123";

        // Using BCrypt Algorithm
        BCryptPasswordEncoder bcryptEncoder = new BCryptPasswordEncoder();
        String bcryptEncodePassword = bcryptEncoder.encode(password);
        log.info("bcrypt Encoded Password: {}", bcryptEncodePassword);

        // Using SCrypt Algorithm
        SCryptPasswordEncoder scryptEncoder = new SCryptPasswordEncoder();
        String scryptEncodePassword = scryptEncoder.encode(password);
        log.info("scrypt Encoded Password: {}", scryptEncodePassword);

        // Using Pbkdf2 Algorithm
        Pbkdf2PasswordEncoder pbkdf2Encoder = new Pbkdf2PasswordEncoder();
        String pbkdf2EncodePassword = pbkdf2Encoder.encode(password);
        log.info("pbkdf2 Encoded Password: {}", pbkdf2EncodePassword);

        // Using Argon2 Algorithm
        Argon2PasswordEncoder argon2Encoder = new Argon2PasswordEncoder();
        String argon2EncodePassword = argon2Encoder.encode(password);
        log.info("argon2 Encoded Password: {}", argon2EncodePassword);
    }
}

Result of sample KDF code using spring-security-crypto

Using Bouncycastle org.bouncycastle

<dependency>
    <groupId>org.bouncycastle</groupId>
    <artifactId>bcpkix-jdk15on</artifactId>
    <version>1.67</version>
</dependency>

package com.company.project;

import java.security.SecureRandom;
import java.util.Base64;
import lombok.extern.slf4j.Slf4j;
import org.bouncycastle.crypto.generators.BCrypt;
import org.bouncycastle.crypto.generators.SCrypt;
import org.springframework.boot.SpringApplication;
import org.springframework.boot.autoconfigure.SpringBootApplication;

@Slf4j
@SpringBootApplication
public class Application {

    public static void main(final String[] args) {
        SpringApplication.run(Application.class, args);

        // Password to be encoded 
        String password = "Password123";

        // Using BCrypt Algorithm
        byte[] bcryptSalt = new byte[16];
        new SecureRandom().nextBytes(bcryptSalt);
        byte[] bcryptEncodedPasswordBytes = BCrypt.generate(password.getBytes(), bcryptSalt, 8);
        log.info("bcrypt Encoded Password: {}", Base64.getUrlEncoder().encodeToString(bcryptEncodedPasswordBytes));

        // Using SCrypt Algorithm
        byte[] scryptSalt = new byte[32];
        new SecureRandom().nextBytes(scryptSalt);
        byte[] scryptEncodePasswordBytes = SCrypt.generate(password.getBytes(), scryptSalt, 8, 8, 1, 10);
        log.info("scrypt Encoded Password: {}", Base64.getUrlEncoder().encodeToString(scryptEncodePasswordBytes));
    }
}

Result of sample KDF code using Bouncycastle

Let's see a common use case of user password storing.

It is considered a bad practice to store plain passwords in any system or database. Storing passwords in plain text leaves them vulnerable to unauthorized access in case of a security breach or database compromise.

Instead of storing passwords as plain text, it should always be securely hashed or processed using a Key Derivation Function (KDF) before being stored.

Using a KDF to hash passwords is a common and recommended practice. KDFs, like PBKDF2, bcrypt, and Argon2, are specifically designed for password hashing and key derivation.

When a user creates or updates their password, the password is passed through the KDF, which applies a one-way cryptographic process to generate a secure hash. This hashed version of the password is then stored in the database, not the actual plain password.
When a user attempts to log in, the provided password is hashed using the same KDF and compared against the stored hashed password. If the hashes match, the password is correct, and the user is granted access. Importantly, since KDFs are one-way functions, the original password cannot be easily derived from the stored hash, providing an extra layer of security.

Sample code to compare user input password with their stored Hash value.

package com.company.project;

import java.security.SecureRandom;
import java.util.Base64;
import lombok.extern.slf4j.Slf4j;
import org.bouncycastle.crypto.generators.BCrypt;
import org.bouncycastle.crypto.generators.SCrypt;
import org.springframework.boot.SpringApplication;
import org.springframework.boot.autoconfigure.SpringBootApplication;
import org.springframework.security.crypto.argon2.Argon2PasswordEncoder;
import org.springframework.security.crypto.password.PasswordEncoder;

@Slf4j
@SpringBootApplication
public class Application {

    public static void main(final String[] args) {
        SpringApplication.run(Application.class, args);

        // Sample passwords to hash and verify
        String password1 = "Password123";
        String password2 = "Password123";
        
        // Create an instance of Argon2PasswordEncoder
        PasswordEncoder passwordEncoder = new Argon2PasswordEncoder();
        
        // Hashing passwords
        String hashedPassword1 = passwordEncoder.encode(password1);
        String hashedPassword2 = passwordEncoder.encode(password2);
        
        // Print the hashed passwords
        log.info("Hashed Password 1: " + hashedPassword1);
        log.info("Hashed Password 2: " + hashedPassword2);
        
        // Verify passwords
        boolean password1Matches = passwordEncoder.matches(password1, hashedPassword1);
        boolean password2Matches = passwordEncoder.matches(password2, hashedPassword2);
        
        log.info("Password 1 Matches: " + password1Matches);
        log.info("Password 2 Matches: " + password2Matches);
    }
}

Result of sample KDF code for user password matching using spring-security-crypto

Thank you for taking the time to read this post. I hope that you found it informative and useful in your own development work.