A Comparative Study on Bayesian Optimization Algorithm for Nutrition Problem

Veiw figure View Table

Table 2. Nutritional composition of foods (100 grams) for breakfast

Download as

Veiw figure View Table

We created objective function with the help of the Eq.1 is as follows.

In this problem, we will be looking for the values of X. The other values are provided as input. The problem constraints can then be written as follows. Coefficients are corresponding to 1 gram of the constraint and objective function

2.1.2. Constraints

Daily minimum standards of nutritional elements for Turkey (Paker, 1996) are given in the Table 3.

Table 3. Daily minimum standards of nutritional elements for Turkey

Download as

Veiw figure View Table

Our model is set up for breakfast in the morning, the weighted averages (thought to be three meals a day) 1/3 per cent constants were determined on the right-hand side. For example; energy’s the weighted average value is 2246 calorie for a day and we used it for a breakfast.

We created constraints with the help of the Eq.2, Eq.3, Eq.4 and Eq.5 are as follows.

The minimum energy requirements for breakfast is set at 750 calorie per day.

The minimum protein requirements for breakfast is set at 23 gram per day.

The minimum oil requirements for breakfast is set at 24 gram per day.

The minimum carbohydrate requirements for breakfast is set at 109 gram per day.

The minimum calcium requirements for breakfast is set at 201 miligram per day.

The minimum iron requirements for breakfast is set at 5 miligram per day.

The maximum potassium requirements for breakfast is set at 1167 miligram per day.

The maximum sodium requirements for breakfast is set at 800 miligram per day.

The maximum Vitamin A requirements for breakfast is set at 1495 per day.

The minimum Vitamin C requirements for breakfast is set at 21 per day.

and the other constraints as follows:

The objective function is the value that we are trying to minimize. In our problem, the objective function is the cost of the entire foods. In our nutrition problem, the number of generations is fixed (up to 100), and the target is to provide effective and adequate nutrition.

2.2. Genetic Algorithm for Nutrition Problem

Genetic Algorithm is a random search algorithm that provides a robust method for searching for the optimum solution to complex problems (Goldberg,1989). In a GA, the problem is represented by a population of strings (or chromosomes, in biological terminology). Each string comprises a number of blocks, which represent the individual decision variables of the problem (genes). The variables represented in the string can be processed in an evaluation function or fitness function,which is in effect the objective function. Strings are processed and combined according to their fitness (objective function value) in order to generate new strings that contain the best features of two parent strings. Strings with the highest fitness have the greatest chance of contributing to future generations, as in the process of natural selection. Excellent introductions to GAs are given by (Goldberg,1989) and (Michalewicz ,1992).

Three fundamental operators are involved in manipulating strings and moving to a new generation: selection, crossover, and mutation. The approach taken to the operators of selection, crossover, and mutation can influence the results obtained, and different problems may require different approaches.

The main data structures in the Genetic Algorithm toolbox are chromosomes, objective function values and fitness values. The chromosome data structure stores an entire population in a single matrix of size Nind x Lind, where Nind is the number of individuals in the population and Lind is the length of the genotypic representation of those individuals. The decision variables in the genetic algorithm are obtained by applying some mapping from the chromosome representation into the decision variable space. An objective function is used to evaluate the performance of the decision variables in the problem domain. Fitness values are derived from objective function values through a scaling or ranking function.

The genetic algorithm uses three main types of rules at each step to create the next generation from the current population (Schmitt and Lothar, 2001), (Schmitt and Lothar, 2004):

• Selection rules select the individuals, called parents, that contribute to the population at the next generation.

• Crossover rules combine two parents to form children fort he next generation.

• Mutation rules apply random changes to individual parents to form children.

Figure 1. M- File showing defined linear equation

Download as

View current figure in a new window

Figures index

Veiw figure View Figure

View next figure

In our nutrition problem, we used the Matlab – Genetic Algorithm toolbox for computational results of Genetic algorithm. We used “gatool” and minimize the our objective function (cost of the breakfast). We defined the objective function in a seperate m – file as shown in Figure 1.

2.3. Bayesian Optimization Algorithm for Nutrition Problem

This section discusses the proposed Bayesian optimization algorithm for the nutrition problem, including the construction of a Bayesian network.

2.3.1. The Construction of a Bayesian Network

Bayesian networks are also called directed graphical models, in which each node corresponds to one variable, and each variable corresponds to one position in the strings representing the solutions. The relationship between two variables is represented by a directed edge between the two corresponding nodes.

Figure 2. A Bayesian network for breakfast

Download as

View current figure in a new window

Figures index

Veiw figure View Figure

View previous figure

Bayesian networks are often used to model multinomial data with both discrete and continuous variables by encoding the relationship between the variables contained in the modeled data, which represents the structure of a problem. Furthermore, they are used to generate new data instances or variables instances with similar properties as those of given data. Figure 2 is the Bayesian network constructed for the nutrition problem, which is an acyclic directed graph representing the solution structure of the problem. Since a solution has no parents, it will be chosen from nodes according to their probabilities. The next will be chosen from nodes according to the probabilities conditioned on the previous nodes. This building process is repeated until the last node has been chosen. A path from nutrient 1 to nutrient m is thus formed where m is the number of the nutrient, representing a new potential solution. Since all probability values for each nutrient are normalized, we suggest the tournament method as a suitable strategy for rule selection (Goldberg, 1989).

The goal of learning is to find the variable values of all nodes that maximize the likelihood of the training data containing a number of independent cases. Thus, the learning in our case amounts to ‘counting’ based on a multinomial distribution. We used R programming for the Bayesian network. It calculates the conditional probabilities of each possible value for each node given all possible values of its parent nodes. Conditional probabilities as follows in Table 4.

Table 4. Conditional probabilities and value of probability

Download as

Veiw figure View Table

These probability values were used to generate new solutions. Then, the goal function with each decision variable being a probabilistic value corresponding to 1 gram of the objective coefficients were obtained by multiplying the price value.

2.3.2. A Bayesian Optimization Algorithm

This section introduces a Bayesian optimization algorithm for the nutrition problem and based on the estimation of conditional probabilities.

In the Bayesian optimization algorithm, the first population is generated at random. From the current population, a set of better strings is selected. Any selection method biased towards better fitness can be used, and in this paper, tournament selection is applied. The conditional probabilities of each node in the Bayesian network are computed. New strings are generated by using these conditional probability values, and are added into the population, replacing some of the old strings. The steps of the Bayesian optimization algorithm for nutrition problem are:

1. Set , and generate an initial population at random;

2. Use tournament to select a set of promising strings from ;

3. Compute the conditional probabilities of each node according to this set of promising solutions;

4. For the each food, the tournament method is used to select string according to the conditional probabilities of all available nodes, thus obtaining a new string. A set of new strings will be generated in this way;

5. Create a new population by replacing some strings from with and set

6. If the termination conditions are not met (we use 100 generations), go to step 2.

3. Computational Results and Discussions

In this section we describe the computational experiments that were used to test BOA. For all experiments, real data sets as given to us by the TUIK are available. Data set consists of 20 nutrients and their prices. For all data instances, we used the following set of fixed parameters to implement our experiments. These parameters are based on our experience and intuition and thus are not necessarily the best for each instance. We have kept them the same for consistency.

3.1. Details of Algorithms

The detailed computational results over these 20 instances are listed in the tables.

• LP: optimal solutions found with linear programming software (Dowsland and Thompson, 2000);

• GA: optimal solutions found with genetic algorithm software (Matlab(R2009b))

• BOA: optimal solutions found with Bayesian optimization algorithm

• Stopping criterion: number of generations = 100, or an optimal solution is found

• Population size = 100

• The number of solutions kept in each generation = 20

• Number of runs per data instance = 20

• Creation Function = uniform

• Selection Function = tournament

• Crossover Function = two point

• Hybrid Function = pattern search

The BOA was coded in Matlab (R2009b), and all experiments were run on the same Pentium (R) 1.60GHz PC with 1GB RAM using the Windows XP operating system. To test the robustness of the BOA, each data instance was run twenty times by fixing the above parameters and varying the pseudorandom number seed at the beginning.

3.2. Analysis of Results

First, let us see the results obtained from all of three methods.

The results obtained from the linear programming solution WinQSB package program: Bread , Milk , Egg, Margarine , Tomato, Sesame Oil and value of goal function (min.) is 1.4532.

The results obtained from the genetic algorithm solution: Bread , Wiener ., Salami , Milk , White Cheese , Cheddar Cheese , Curd Cheese , Egg , Buttter , Margarine , Tomato , Cucumber , Black Olive , Green Olive , Jam , Sesame Oil , Boiled Grape Juice and value of goal function (min.) is 1.7131.

The results obtained from the Bayesian optimization algorithm solution: Bread , Sausage , White Cheese , Cheddar Cheese , Curd Cheese , Egg , Margarine , Jam , Honey , Sesame Oil , Boiled Grape Juice and value of goal function (min.) is 1.017.

Table 5. All of the results obtained from Linear Programming, Genetic Algorithm and Bayesian Optimization Algorithm

Download as

Veiw figure View Table