The k-Language Classification , a Proposed New Theory for Image Classification and Clustering at Pixel Level

This theory attempted to explore the possibility of using regular language further in image analysis, departing from the use of string to represent the region in the image. But we are not trying to show an alternative idea about how to generate a string region, where there are many different ways how the image or region produces strings representing, in this paper we propose a way how to generate regular language or group of languages which performs both classify the set of strings generated by a group of a number of image regions. Researchers began by showing a proof that there is always a regular language that accepts a set of strings that produced the image, and then use the language to perform the classification. Research then expanded to the pixel level, on whether the regular language can be used for clustering pixels in the image, the researchers propose a systematic solution of this question. As a tool used to explore regular language is deterministic finite automata. On the end part before conclusion of this paper, we add revision version of this theory. There is another point of view to revision version, added for make this method more precision and more powerfull from before. Keywords; pixel, region, image, regular language, deterministic finite automata


Introduction
This paper is essentially all of the ideas that emerged in the course of digital image processing is followed by researchers, inspired by the contents of the lecture very interesting for researchers so that so many questions that researchers ask in it and enlightenment as well as the inspiration to write it in the results of intellectual contemplation a concept, if not too much, researchers are trying to formulate a theory that tried to summarize the results of such reflection.Researchers apologize if none use arguments (a theorem or standard definition) derived from previous research papers or textbooks before, all definitions, theorems and consequences expressed in this paper are solely the proposal of the theory that describes the research foundation researchers are trying to put forward independently.Therefore there are no citations used in this paper except the bibliography used solely as a research literature course.
By the way, because it is not directly based on the arguments of others explicitly, researchers tried to be guided by the logic of the system (the calculus of proposition and predicate calculus) and mathematical sets to validate any deductive statements expressed independently.May thus be a pretty solid foundation for the theory proposed.
We call this theory as a k-language, as a form of naming is commensurate with k-mean.both have a very different perspective, the k-mean using a set of numbers in the way it works, the k-language using a set of string.But even so they are supervised classification.
This theory proposed subsequent decline can begin as follows: Discussion of this idea about using regular language to perform up to the level of pixels clustering performed using deterministic finite state automata (DFA), in which a regular language has an equivalent representation with a DFA so simply by using the DFA, all the analysis and construction of regular languages can be done.
On Digital Image Processing lecture, noted that the region basically be represented by a string, so on an image of n-region can be obtained n strings.Region inside the classification process is done by defining a distance metric that measures the similarity between strings.For whether it is a set of n strings that may be forming a certain language that may be based on the language that one can perform analysis and classification In the first part of this paper argued that there is always a regular language that accepts an arbitrary finite set of strings, then it can be constructed based on a language or some regular languages that can be used to analyze an image linguistically and also perform classification.

Construction of regular language from a digital image
In successive paragraphs below, researchers first proposed a theorem that states that if we have an arbitrary finite set of string then there will always be at least one regular language can be built and use the string set, which in its DFA representation means there is a minimal DFA automata who received the entire string.This proposal is the following theorem:

Theorem 1 (Existence of Regular Languages):
Suppose there is a finite set of arbitrary G string where the entire alphabet is from the set of alphabet E, then at least there is a regular language L which can be constructed for the entire string in the set G. Proof: Because for any regular language L with a finite string can always be made an equivalent DFA, then take M ={S,E, d, s0, F} as automata equivalence of regular languages L. where : S = the entire set of states E = any alphabet set of L d = a mapping is defined as follows d : E x S S s0 = initial state F = the entire set of final states Then choose F = S, Then the automaton M accepts any string of any combination of the alphabet in E.
Suppose there is a string of any combination of alphabet Y in E, ie Y ∈ E *, and is not accepted by M, then Y will end or finish in certain state e.g.A where A is not a member of F, i.e.A ∉ F.
But A definite elements S, A ∈ S, because S is a universal set of states, this means that S is not equal to F, ie S ≠ F. This conclusion contradicts our choice for F, i.e F = S. Thus the evidence presented.
Based on the theorem that the researchers propose urther research proposes the following consequences:

Result 1:
Suppose that an image with n region in it, then the whole region was represented by a string respectively, then the image can be constructed in a certain regular language that accepts the set of strings formed by the entire region.
Proof: By theorem 1 is trivial.

Result 2:
Suppose that an image with n pixels making up, and each pixel can be represented by a string, then the image can be constructed in a certain regular language that accepts all strings generated by the pixels in the image.
Proof: By theorem 1 is trivial.
Thus these effects are proposed.As for how a pixel to produce a string, this being a separate ideas to be put forward at the end of the research paper, so it is hoped regular language can be used to perform clustering point on the image.
Further noted was about ideas on how to order the regular language L that has ensured its existence there by theorem 1 can be used to perform classification.

Regular language decomposition for classification
For the purposes of this classification, the researchers first put forward an idea of the decomposition of regular languages.This idea is written exactly in the proposed definition as follows: Definition 1 (Decomposition language): Suppose a regular language L is equivalent to automata M = {S, E, d, s0, F}, written L ≡ M, then the decomposition of L written L = L 1 ⊕L 2 ⊕L 3 ⊕...⊕L n .where L i ≡ M i , M i = {S,E, d, s0, F i } and F i ∩F j =∅, i,j = 1,2,3,...,n.i≠j.
By definition 1, the researchers suggest this may further stated the opinion that, an image which k region can basically be classified into the decomposition of regular languages L which by Theorem 1 is exist and can be constructed thereon.

Example:
Suppose that an image G with 3 region in it, each can be represented in the following strings: Region1 = aaabb Region2 = ccab Region3 = cacabb Create an automaton M = {S, E, d, s0, F} where: S = {A, B, C}, E = {a, b, c}, s0 = A, and selected F = S, and d can be chosen as follows: Region classification in the image of G is based on the decomposition of L is obtained: Similarly, regions classification process of the image is based on the idea of decomposition is proposed language researchers.
Next on the following ideas, on how to perform image classification at a global level, which if given n image, how do image classification based on regular languages EPJ Web of Conferences that can be constructed on each image?
The first researchers to propose a terminology, the concept of the dominant language.A language is dominant on an image if and only if he receives more regions in the image.This terminology researchers propose in the form of the following definition: Definition 2 (Dominant Language): Suppose L is a regular language that is constructed over an image of G, where L can be decomposed into L = L 1 ⊕L 2 ⊕L 3 ⊕...⊕L n , if the classification of all the regions in the image of G causes card (L i ) = max (card (L 1 ), card (L 2 ), card (L 3 ), ..., card (L n )), i = 1,2,3, ..., n then L i expressed as the dominant language in the image of G.
Furthermore, researchers argue that two images said to a group if only if both have the same dominant language.This way is the first to build a language L by Theorem 1 that receives the entire region in the whole image.This group proposed the idea in the form of the following definition: Definition 3 (the way for classification): Given the number of images, which can be built a language L is accepted throughout the region on the whole image.Two images in a language said to a group (have the same common language) if and only if they have the same dominant language.Examples of the use of this idea: Suppose there are images G 1 ,G 2 ,G 3 , ... , G n , and created a language L is accepted throughout the region on the whole image of G 1 ,G 2 ,G 3 , ... , G n , corresponding theorem 1, for example from decomposition and selection to every region throughout the images G 1 ,G 2 ,G 3 , ... , G n , found that the dominant language of G 3 , G 5 is L 5 the same as the language of the dominant of G 7 , the classification results obtained G 3 ,G 5 ,G 7 ∈ L 5 .
Thus the idea was proposed as a new way to classify the image globally.
Later in the following chapter the researcher did propose a way on how to do clustering at the level of pixels in an image.

Proposed ideas on how to do clustering at pixel level
Using of regular language at the level of pixels of the image leads to a new question, namely how to represent how a pixel with a string?
At below researchers propose an idea on how to represent a pixel with a string, or how a pixel to produce a string.
The first time will be seen that 0,1,2,3,4,5,6,7,8,9 is an alphabet, and therefore does not have the meaning of arithmetic and ordinal value.
The second will be seen that the mapping function that maps a pixel value for a pixel quantization values or grayscale value of 0-255 is a string function.i.e. a function that maps a string of pixels with a combination of alphabet {0,1,2,3,4,5,6,7,8,9}.So it is no longer seen as a function of numbers (arithmetic).This idea was proposed in the form of the following definition:

Definition 4 (string function of pixels):
Let E = {0,1,2,3,4,5,6,7,8,9} is an alphabet and combinations thereof are seen as a string (not a number), and an image of G is also set pixels then a mapping f that maps are one to one from a pixel to a combination of the alphabet, is written: f : G E*, called a pixel string functions.Thus it can be seen that the fundamental has a pixel can produce or represented by a string.
Until this proposal, other researchers offer another way to represent a pixel with a string.This proposal is based on the nature neighborhoods of each pixel in the image.
The proposal put forward is exactly the definition 4.a as follows: Definition 4.a Suppose A i is an interval of integers in the interval [0,255], i = 1,2,3, ..., n where A i ≠ ∅ and A i ∩A j =∅ for i ≠ j and an alphabet E = {a 1 ,a 2 ,a 3 ,...,a n }, the value of the alphabet A i is expressed by the following function: Suppose g is a pixel in the image of G, and A i is an interval of integers in the interval [0,255], i, j = 1,2,3, ..., n where A i ≠ ∅ and A i ∩A j =∅ for i ≠ j where value of g ∈ A i and an alphabet E = {a 1 ,a 2 ,a 3 ,...,a n and there is a mapping f : A i a i , then the pixel value g alphabet is expressed by a i .
Furthermore, based on the definition 4.a and 4.b can be argued that the idea of the neighboring string functions declared in the definition 5.

Definisi 5 (neighboring string functions):
Suppose g is a pixel in the image of G, and Np is the neighborhoods p on g and A i are intervals in [0,255 i = 1,2,3, ..., n where A i ≠ ∅ and A i ∩A j =∅ for i ≠ j and an alphabet E = {a 1 ,a 2 ,a 3 ,...,a n } then g string value based on Np expressed by mapping f Np : f Np : g b 1 b 2 b 3 ....b p b p+1 where b i ∈ E is the alphabet value of neighboring pixels to i, p i ∈ Np i=1,2,3,...p , and b p+1 is the value of alphabet g, whereas the value of the alphabet starts at left corner pixel over to the right, and apply: f : A i a i , E = {a 1 ,a 2 ,a 3 ,...,a n } Example: Suppose g is a pixels in the image G and is have value 27 in grayscale.Create a grayscale intervals as follows: Thus this proposal can be used to perform clustering of pixels in an image that is in the following way:

Examples of clustering in an image by using regular language
Suppose given an image of G, Theorem 1 can be constructed by a regular language that accepts all pixels in the image.Choose L where L ≡ M has the following description: M = {S,E, d, s0, F} where S = {A 1 ,A 2 ,A 3 ,...,A 10 } the number of states can be selected at will and are not restricted as in the previous theorem 1, and E = {0,1,2,3,4,5,6,7,8,9}, d is chosen so that d : E x S S , for example have been d as follows: and selected F = S, and s0 =A 1 .Further decomposition can be done at will against the desired L origin meets the definition 1 of the decomposition, suppose the desired decomposition in 10 languages.Then all the pixels in the image can be clustered in 10 clusters.namely L=L 1 ⊕L 2 ⊕L 3 ⊕...⊕L 10 , where L i ≡ M i ={S,E, d, s0, F i } and F i = {A i }, i = 1,2,3,4,5,6,7,8,9,10.
Clustering in 10 clusters can then be performed on G for each pixel in G, based on the previous definitions, researchers found three ways that can be selected to perform clustering, these three ways are as follows: 1.Using the definition 4 of string functions defined on G.
Suppose : f(Pixel1) = 233 f(Pixel2) = 143 f(Pixel3) = 20 f(Pixel4) = 0 by passing on automata M, obtained clustering result as follows: Next is an example of using the definition 5 of the neighbors string functions defined above G for each point p ∈ G. M = {S, E, d, s0, F} where S = {A 1 ,A 2 ,A 3 ,...,A 10 } the number of states can be selected at will and are not restricted as in the previous theorem 1, and E = {a, b, c, d, e, g, h, i, j, k}, d is chosen so that d : E x S S d chosen for this example as follows: and selected F = S, and s0 =A 1 .Further decomposition can be done at will against the desired L origin meets the definition 1 of the decomposition, suppose the desired decomposition in 10 languages.Then all the pixels in the image can be clustered in 10 clusters.namely L=L 1 ⊕L 2 ⊕L 3 ⊕...⊕L 10 , where .
Find the dominant language of the most widely accepted of neighboring pixels p ∈ G, with the view that each pixel neighbors of p produces a string by definition 4, the function string pixels, and that two pixels are in one cluster if only if it has the same dominant language on the neighbors.This idea can be formulated in the form of researchers following definition: Definisi 6 (Similarity between pixels in a regular language): For example an image of G, and p1, p2 ∈ G, and Np 1 , Np 2 respectively neigbourhood of p1, p2.Both are said to have a common language if and only if the dominant language on the Np 1 is the same as the dominant language in Np 2 .
Therefore, based on the dominant language similarity, clustering in image G can be done.Thus the proposed ideas.

The revision version of theory
This part is a revision of our theory about K-Language, based on critics in reviews by reviewer.
We try to perform some another aproach to the theory so we hope the theory get precision and more powerfull.
We begun idea by construct a language band, that is a band of grayscale gradient of pixel.

Definition 7 (a language band):
A language band is a set of regular languages LB construct by a bijection function f that mapping every value in grayscale gradient to a set of regular languages LB.
The number of alfabet is arbitrary, where we choose the alfabet is {a,b,c,d,e,g,h,i,j,k}.
The next, we define a concept about a channel language.Definition 8 (a channel language): L is a channel language if L is a subset of LB.
By definition 8,our meaning of an understanding about k-language is k-channel of language from a language band of grayscale.Example if we consider there are 5 region in image that we want to classified, means that there are 5 channel of language must use to classify every pixel in image.
The example for this definition: Suppose we have channels C1 and C2, we define C1 and C2 as: The channel language is a dominant language of pixel.12. Then the pixel is an element of region with that dominant language.13.END.
Explanation in this paper indicate that the proposed theory researchers, could consistently perform classification at a global level of the image and also can perform up to the level of clustering pixels of the image, and all that remains consistent with the foundations being built and proposed the theory itself.
Researchers are very interested to develop this theory further proposed, based on the new questions that researchers found, that is, if the image is seen as merely a set of strings, it means that the image is no longer rooted in the quantitative significance or meaning of arithmetic, in which each pixel has a value seen Quantitative pixels (gray value), if so the meaning of the image is now rooted?Researchers noticed that the image must necessarily be rooted in semantic meanings and the meanings of logic.But what is the logic of the system that stands above the regular language?Is not our logic system is basically built on natural language (eg English) that are at the top hierarchy of language Chomsky classification.
Researchers took the decision to develop a system of regular logic-based language and hoped that the system is trying to develop a logic that researchers generally applies to all languages in the Chomsky hierarchy.
Research on this has started to develop a research-based preposition calculus of regular languages, as well as researchers hope to predicate calculus, researchers tried to advance the foundations of a new logic that is different from the normal logical foundation before, and perhaps at other times researchers can propose ideas about this.
Similarly, researchers looked at the possibility of applying the ideas in this paper on the problem set of agents that communicate with each other, to see that every agent produces a set of strings that can be accepted by another agent.In this case, researchers have also started trying to develop the assumption that an agent is an automaton.
This paper was written thus may be useful.We are sorry if there are typos in the writing symbols.