IEEE Xplore Full-Text HTML : Conservation of Information in Search: Measuring the Cost of Success

Conservation of Information in Search: Measuring the Cost of Success

Conservation of information theorems indicate that any search algorithm performs, on average, as well as random search without replacement unless it takes advantage of problem-specific information about the search target or the search-space structure. Combinatorics shows that even a moderately sized search requires problem-specific information to be successful. Computers, despite their speed in performing queries, are completely inadequate for resolving even moderately sized search problems without accurate information to guide them. We propose three measures to characterize the information required for successful search: 1) endogenous information, which measures the difficulty of finding a target using random search; 2) exogenous information, which measures the difficulty that remains in finding a target once a search takes advantage of problem-specific information; and 3) active information, which, as the difference between endogenous and exogenous information, measures the contribution of problem-specific information for successfully finding a target. This paper develops a methodology based on these information measures to gauge the effectiveness with which problem-specific information facilitates successful search. It then applies this methodology to various search tools widely used in evolutionary search.

This paper was recommended by Associate Editor Y. Wang.

W. A. Dembski is with the Department of Philosophy, Southwestern Baptist Theological Seminary, Fort Worth, TX 76115 USA (e-mail: wdembski@designinference.com).

R. J. Marks II is with the Department of Electrical and Computer Engineering, Baylor University, Waco, TX 76798 USA (e-mail: robert_marks@baylor.edu).

Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org.

¹ This expression is recognized as the Kullback–Leibler distance between the structured search space and the uniform distribution [9].

² The derivation is standard [9], [41], [50]. The number of ways $\{\ell_{n}\vert1\leq n\leq N\}$ objects can be arranged is given by the multinomial coefficient $\vert\omega \vert = L!/\prod_{n = 1}^{N}\ell_{n}!$. For large $M$, from Stirling's formula, $\ln M!\rightarrow M$ $\ln M$ and $\vert\omega \vert\approx e^{L\ln L} \exp(\prod_{n = 1}^{N}\ell_{n}\ln \ell_{n})$. Applying (14) gives $\vert\omega \vert\approx e^{LH_{N}}$ where the entropy is measured in nats. A nat is the unit of information when a natural log is used, for example, in (2). When measured in bits $(\log_{2})$, we obtain (15).

³The conventional notation for the $L$th harmonic number is $H_{L}$. We use ${\cal H}_{L}$ to avoid any confusion with the symbol for entropy.

⁴Deleting the first phrase also increases the active information—but by not as much. If the second phrase is searched using a uniform prior, then the added information for finding the third phrase is $I_{+} = 38\ \hbox{b}$.

1. H. M. Abbas and M. M. Bayoumi

"Volterra-system identification using adaptive real-coded genetic algorithm"

IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 36, no. 4, pp. 671-684, 2006

Quick Abstract | Show Context | Full Text: PDF

2. S. Agrawal , Y. Dashora , M. K. Tiwari and Y.-J. Son

"Interactive particle swarm: A pareto-adaptive metaheuristic to multiobjective optimization"

IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 38, no. 2, pp. 258-277, 2008

Quick Abstract | Show Context | Full Text: PDF

3. M. Aigner

Discrete Mathematics

2007, Amer. Math. Soc.

4. J. Bernoulli

"Ars conjectandi (the art of conjecturing)"

Tractatus De Seriebus Infinitis, 1713

5. L. Brillouin

Science and Information Theory

1956, Academic

Show Context

6. J. P. Burg

Maximum entropy spectral analysis

1975

7. S. Christensen and F. Oppacher

"What can we learn from no free lunch? A first attempt to characterize the concept of a searchable"

Proc. Genetic Evol. Comput., pp. 1219-1226, 2001

Show Context

8. T.-Y. Chou , T.-K. Liu , C.-N. Lee and C.-R. Jeng

"Method of inequality-based multiobjective genetic algorithm for domestic daily aircraft routing"

IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 38, no. 2, pp. 299-308, 2008

Quick Abstract | Show Context | Full Text: PDF

9. T. M. Cover and J. A. Thomas

Elements of Information Theory

2006, Wiley-Interscience

Show Context

10. J. C. Culberson

"On the futility of blind search: An algorithmic view of \'no free lunch\'"

Evol. Comput., vol. 6, no. 2, pp. 109-127, 1998

11. J. D. Cutnell and J. W. Kenneth

Physics

1995, Wiley

12. R. Dawkins

The Blind Watchmaker: Why the Evidence of Evolution Reveals a Universe Without Design

1996, Norton

13. S. Droste , T. Jansen and I. Wegener

"Perhaps not a free lunch but at least a free appetizer"

Proc. 1st GECCO, pp. 833-839,

14. R. C. Eberhart , Y. Shi and J. Kennedy

Swarm Intelligence

2001, Morgan Kaufmann

15. T. M. English

"Some information theoretic results on evolutionary optimization"

Proc. CEC, vol. 1, p. 795, 1999

Quick Abstract | Show Context | Full Text: PDF

16. M. S. Fadali , Y. Zhang and S. J. Louis

"Robust stability analysis of discrete-time systems using genetic algorithms"

IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 29, no. 5, pp. 503-508, 1999

Quick Abstract | Show Context | Full Text: PDF

17. L. K. Grover

"A fast quantum mechanical algorithm for data search"

Proc. ACM Symp. Theory Comput., pp. 212-219, 1996

18. P. Guturu and R. Dantu

"An impatient evolutionary algorithm with probabilistic tabu search for unified solution of some NP-hard problems in graph and set theory via clique finding"

IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 3, pp. 645-666, 2008

Quick Abstract | Show Context | Full Text: PDF

19. T. D. Gwiazda

Genetic Algorithms Reference

2006, Tomasz Gwiazda

20. J. S. Gyorfi and C.-H. Wu

"An efficient algorithm for placement sequence and feeder assignment problems with multiple placement-nozzles and independent link evaluation"

IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 38, no. 2, pp. 437-442, 2008

Quick Abstract | Show Context | Full Text: PDF

21. M. J. Healy

"Personal Communication"

The Boeing Company

22. S.-Y. Ho , H.-S. Lin , W.-H. Liauh and S.-J. Ho

"OPSO: Orthogonal particle swarm optimization and its application to task assignment problems"

IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 38, no. 2, pp. 288-298, 2008

Quick Abstract | Show Context | Full Text: PDF

23. Y.-C. Ho and D. L. Pepyne

"Simple explanation of the no free lunch theorem"

Proc. 40th IEEE Conf. Decision Control, pp. 4409-4414, 2001

Quick Abstract | Show Context | Full Text: PDF

24. Y.-C. Ho , Q.-C. Zhao and D. L. Pepyne

"The no free lunch theorems: Complexity and security"

IEEE Trans. Autom. Control, vol. 48, no. 5, pp. 783-793, 2003

Quick Abstract | Show Context | Full Text: PDF

25. C. A. Jensen , R. D. Reed , R. J. Marks II, M. A. El-Sharkawi , J.-B. Jung , R. T. Miyamoto , G. M. Anderson and C. J. Eggen

"Inversion of feedforward neural networks: Algorithms and applications"

Proc. IEEE, vol. 87, no. 9, pp. 1536-1549, 1999

Quick Abstract | Show Context | Full Text: PDF

26. M. Koppen , D. H. Wolpert and W. G. Macready

"Remarks on a recent paper on the \'no free lunch\' theorems"

IEEE Trans. Evol. Comput., vol. 5, no. 3, pp. 295-296, 2001

Quick Abstract | Full Text: PDF

27. G. Korodi , I. Tabus , J. Rissanen and J. Astola

"DNA sequence compression"

IEEE Signal Process. Mag., vol. 47, no. 1, pp. 47-53, 2007

Quick Abstract | Show Context | Full Text: PDF

28. C.-Y. Lee

"Entropy—Boltzmann selection in the genetic algorithms"

IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 33, no. 1, pp. 138-149, 2003

Quick Abstract | Full Text: PDF

29. R. E. Lenski , C. Ofria , R. T. Pennock and C. Adami

"The evolutionary origin of complex features"

Nature, vol. 423, no. 6936, pp. 139-144, 2003

30. Y. Li and C. O. Wilke

"Digital evolution in time-dependent fitness landscapes"

Artif. Life, vol. 10, no. 2, pp. 123-134, 2004

31. B. Liu , L. Wang and Y.-H. Jin

"An effective PSO-based memetic algorithm for flow shop scheduling"

IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 37, no. 1, pp. 18-27, 2007

Quick Abstract | Show Context | Full Text: PDF

This paper proposes an effective particle swarm optimization (PSO)-based memetic algorithm (MA) for the permutation flow shop scheduling problem (PFSSP) with the objective to minimize the maximum completion time, which is a typical non-deterministic polynomial-time (NP) hard combinatorial optimization problem. In the proposed PSO-based MA (PSOMA), both PSO-based searching operators and some special local searching operators are designed to balance the exploration and exploitation abilities. In particular, the PSOMA applies the evolutionary searching mechanism of PSO, which is characterized by individual improvement, population cooperation, and competition to effectively perform exploration. On the other hand, the PSOMA utilizes several adaptive local searches to perform exploitation. First, to make PSO suitable for solving PFSSP, a ranked-order value rule based on random key representation is presented to convert the continuous position values of particles to job permutations. Second, to generate an initial swarm with certain quality and diversity, the famous Nawaz-Enscore-Ham (NEH) heuristic is incorporated into the initialization of population. Third, to balance the exploration and exploitation abilities, after the standard PSO-based searching operation, a new local search technique named NEH_1 insertion is probabilistically applied to some good particles selected by using a roulette wheel mechanism with a specified probability. Fourth, to enrich the searching behaviors and to avoid premature convergence, a simulated annealing (SA)-based local search with multiple different neighborhoods is designed and incorporated into the PSOMA. Meanwhile, an effective adaptive meta-Lamarckian learning strategy is employed to decide which neighborhood to be used in SA-based local search. Finally, to further enhance the exploitation ability, a pairwise-based local search is applied after the SA-based search. Simulation results based on benchmarks demonstrate the effectiveness of- - the PSOMA. Additionally, the effects of some parameters on optimization performances are also discussed

Full Abstract

32. D. J. C. MacKay

Information Theory, Inference and Learning Algorithms

2002, Cambridge Univ. Press

33. R. J. Marks II

Handbook of Fourier Analysis and Its Application

2009, Oxford Univ. Press

34. A. Papoulis

Probability, Random Variables, and Stochastic Processes

pp. 537-542, 1991, McGraw-Hill

Show Context

35. U. S. Putro , K. Kijima and S. Takahashi

"Adaptive learning of hypergame situations using a genetic algorithm"

IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 30, no. 5, pp. 562-572, 2000

Quick Abstract | Show Context | Full Text: PDF

36. R. D. Reed and R. J. Marks II

Neural Smithing: Supervised Learning in Feedforward Artificial Neural Networks

1999, MIT Press

37. K. Takita and Y. Kakazu

"Automatic agent design based on gate growth-application to wall following problem"

Proc. 37th Int. Session Papers SICE Annu. Conf., pp. 863-868, 1998

Quick Abstract | Show Context | Full Text: PDF

38. F. Rojas , C. G. Puntonet , M. Rodriguez-Alvarez , I. Rojas and R. Martin-Clemente

"Blind source separation in post-nonlinear mixtures using competitive learning, Simulated annealing, and a genetic algorithm"

IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 34, no. 4, pp. 407-416, 2004

Quick Abstract | Show Context | Full Text: PDF

39. C. Schaffer H. Hirsh and W. W. Cohen

"A conservation law for generalization performance"

Proc. 11th Int. Conf. Mach. Learn., pp. 259-265, 1994

40. T. D. Schneider

"Evolution of biological information"

Nucleic Acids Res., vol. 28, no. 14, pp. 2794-2799, 2000

41. C. E. Shannon

"A mathematical theory of communication"

Bell Syst. Tech. J., vol. 27, pp. 379-423, 1948

42. R. Srinivasan

Importance Sampling

2002, Springer-Verlag

43. B. Weinberg and E. G. Talbi

"NFL theorem is unusable on structured classes of problems"

Proc. CEC, vol. 1, pp. 220-226, 2004

Quick Abstract | Show Context | Full Text: PDF

44. A. M. Turing

"On computable numbers with an application to the entscheidungs problem"

Proc. Lond. Math. Soc. Ser. 2, vol. 42, pp. 230-265, 1936

45. G. S. Tewolde and W. Sheng

"Robot path integration in manufacturing processes: Genetic algorithm versus ant colony optimization"

IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 38, no. 2, pp. 278-287, 2008

Quick Abstract | Show Context | Full Text: PDF

46. D. Wolpert and W. G. Macready

"No free lunch theorems for optimization"

IEEE Trans. Evol. Comput., vol. 1, no. 1, pp. 67-82, 1997

Quick Abstract | Show Context | Full Text: PDF

47. D. H. Wolpert and W. G. Macready

"Coevolutionary free lunches"

IEEE Trans. Evol. Comput., vol. 9, no. 6, pp. 721-735, 2005

Quick Abstract | Full Text: PDF

48. E. V. Wright

Gadsby

1997, Lightyear Press

49. Y.-C. Xu and R.-B. Xiao

"Solving the identifying code problem by a genetic algorithm"

IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 37, no. 1, pp. 41-46, 2007

Quick Abstract | Show Context | Full Text: PDF

50. H. P. Yockey

Information Theory, Evolution, and the Origin of Life

2005, Cambridge Univ. Press

51. F. Yu , F. Tu and K. R. Pattipati

"A novel congruent organizational design methodology using group technology and a nested genetic algorithm"

IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 36, no. 1, pp. 5-18, 2006

Quick Abstract | Show Context | Full Text: PDF

Given simple agent rules, a swarm's emergent behavior can be difficult to predict. The inverse problem is even more difficult: Given a desired emergent behavior, what are the rules by which swarm agents should operate? Disjunctive fuzzy control is proposed as a method to model swarm agents. Compared to more commonly used conjunctive fuzzy control such as that proposed by Mamdani, disjunctive fuzzy control is robustly fault tolerant and disjointly connected. Swarms are inherently disjunctive. Instead of agents working in coordination with one another, each swarm agent contributes individually to the result. The disjunctive attribute can also be applied at the sensor level for each individual agent. Disjunctive control allows adaptation of the describing membership function, as is commonly done in conjunctive control. The inversion process is illustrated with numerous simulation examples, including a predator-prey game, gang warfare, and escaping agents. The swarm is instructed what to do but not how to do it. Imposition of fitness constraints and repeated generations of evolutionary molding of agent performance can then result in unexpected emergent behaviors of the swarm, e.g., use of decoys, self-sacrifice, flanking maneuvers, and shielding of the weak.

Full Abstract

Conservation of information (COI) popularized by the no free lunch theorem is a great leveler of search algorithms, showing that on average no search outperforms any other. Yet in practice some searches appear to outperform others. In consequence, some have questioned the significance of COI to the performance of search algorithms. An underlying foundation of COI is Bernoulli's Principle of Insufficient Reason(PrOIR) which imposes of a uniform distribution on a search space in the absence of all prior knowledge about the search target or the search space structure. The assumption is conserved under mapping. If the probability of finding a target in a search space is p, then the problem of finding the target in any subset of the search space is p. More generally, all some-to-many mappings of a uniform search space result in a new search space where the chance of doing better than p is 50-50. Consequently the chance of doing worse is 50-50. This result can be viewed as a confirming property of COI. To properly assess the significance of the COI for search, one must completely identify the precise sources of information that affect search performance. This discussion leads to resolution of the seeming conflict between COI and the observation that some search algorithms perform well on a large class of problems.

Full Abstract

According to conservation of information theorems, performance of an arbitrarily chosen search, on average, does no better than blind search. Domain expertise and prior knowledge about search space structure or target location is therefore essential in crafting the search algorithm. The effectiveness of a given algorithm can be measured by the active information introduced to the search. We illustrate this by identifying sources of active information in Avida, a software program designed to search for logic functions using nand gates. Avida uses stair step active information by rewarding logic functions using a smaller number of nands to construct functions requiring more. Removing stair steps deteriorates Avida's performance while removing deleterious instructions improves it. Some search algorithms use prior knowledge better than others. For the Avida digital organism, a simple evolutionary strategy generates the Avida target in far fewer instructions using only the prior knowledge available to Avida.

Full Abstract

Baylor University

Conservation of Information in Search: Measuring the Cost of Success

IEEE Keywords

INSPEC: Controlled Indexing

INSPEC: Non-Controlled Indexing

Authors Keywords

INFORMATION AS A COST OF SEARCH

MEASURING ACTIVE INFORMATION

EXAMPLES OF ACTIVE INFORMATION IN SEARCH

A. Repeated Queries

B. Subset Search

C. Importance Sampling

Example 1

D. FOO

1) Searching a Type Class

E. Partitioned Search

F. Random Mutation

1) Choosing the Fittest of a Number of Mutated Offspring

2) Optimization by Mutation

3) Optimization by Mutation With Elitism

G. Stair-Step Search

CRITIQUING EVOLUTIONARY-SEARCH ALGORITHMS

A. Monkey at a Typewriter

CONCLUSION

Footnotes

References

Authors

Cited By

Cited by IEEE

Cited by Other Publishers

Keywords

IEEE Keywords

INSPEC: Controlled Indexing

INSPEC: Non-Controlled Indexing

Authors Keywords

Corrections

Multimedia

Text Size

Related Articles

IEEE Account

Purchase Details

Profile Information

Need Help?